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Stationary Determinantal Processes: 
Phase Multiplicity, Bernoullicity, 
Entropy, and Domination 

by Russell Lyons and Jeffrey E. Steif 



Abstract. We study a class of stationary processes indexed by Z that are 
denned via minors of (i-dimensional (multilevel) Toeplitz matrices. We obtain 
necessary and sufficient conditions for phase multiplicity (the existence of 
a phase transition) analogous to that which occurs in statistical mechanics. 
Phase uniqueness is equivalent to the presence of a strong K property, a 
particular strengthening of the usual K (Kolmogorov) property. We show 
that all of these processes are Bernoulli shifts (isomorphic to i.i.d. processes 
in the sense of ergodic theory) . We obtain estimates of their entropies and we 
relate these processes via stochastic domination to product measures. 
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§1. Introduction. 

Determinantal probability measures and point processes arise in numerous settings, 
such as mathematical physics (where they are called fermionic point processes), random 
matrix theory, representation theory, and certain other areas of probability theory. See 
Soshnikov (|2000| ) for a survey and Lyons (|2002a|) for additional developments in the discrete 



case. We present here a detailed analysis of the discrete stationary case. After this paper 
was first written, we learned of independent but slightly prior work of Shirai and Takahashi 
( |2003|) , announced in Shirai and Takahashi (|2000|) . We discuss the (small) overlap between 



their work and ours in the appropriate places below. See also Shirai and Takahashi (|2002| ) 
and Shirai and Yoo (|2003|) for related contemporaneous work. 



Stationary determinantal processes are interesting from several viewpoints. First, they 
have interesting relations with the theory of Toeplitz determinants. As in that theory, the 
geometric mean of a nonnegative function /, defined as 



GM(/) :=exp J log/ 



will play an important role in some of our results. (In fact, the arithmetic mean and the 
harmonic mean of / will also characterize certain properties of our processes.) Second, such 
processes arise in certain combinatorial models, such as uniform spanning trees and dimer 
models. Third, these systems have a rich infinite-dimensional parameter space, consisting, 
in the case of a Z d action, of all measurable functions / from the <i-dimensional torus 
to [0,1]. We illustrate the variety of possible behaviors with examples throughout the 
paper. Fourth, they have the unusual property of negative association. Though unusual, 
negative association occurs in various places and a fair amount is known about it (see Joag- 
Dev and Proschan ( PSBD , Pemantle (gMj). Newman ([T5S§, Shao and Su (|OT|), Shao 



( |2UUU ), Zhang and Wen ( |2UU1| ), Zhang ( |2UU1 ), and the references therein, for example). 



We offer a whole new class of examples of negatively associated stationary processes. As 
such, determinantal processes provide an easy way to construct examples of many kinds 
of behavior that might otherwise be difficult to construct, such as negatively associated 
(stationary) processes with slow decay of correlations, or with the even sites independent 
of the odd sites (in one dimension, say), or with the property of being finitely dependent. 
Fifth, all our processes are Bernoulli shifts, i.e., isomorphic to i.i.d. processes. This may 
be surprising in view of the fact that only measurability, rather than smoothness, of the 
parameter / is required. Sixth, in one dimension, some of the processes are strong K, while 
others are not. Namely, strong K is equivalent to /(l — /) having a positive geometric 
mean. Similarly, we characterize exactly, in all dimensions, which / give strong full K 
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systems. It turns out that not only the rate at which / approaches or 1 matters, but also 
where. For example, in two dimensions, if / is real analytic, then the system is strong full 
K iff the (possibly empty) sets / _1 (0) and belong to nontrivial algebraic varieties. 

The strong full K property is analogous to phase uniqueness in statistical physics, as we 
explain in Section [<]. 

We now state our results somewhat more precisely and present several examples. Let 
/ : T d — > [0, 1] be a Lebesgue-measurable function on the (i-dimensional torus T d := M d /Z d . 
Define a Z d -invariant probability measure P-^ on the Borel sets of {0, 1} Z by defining the 
probability of the cylinder sets 

P / [r 7 (e 1 ) = 1, . . . , 77(e fc ) = 1] := P f [{rj G {0, if* ; 7]{e x ) = 1, . . . , V (e k ) = 1}] 

:= det[/(e_j - e^Ki^x^ 

for all ei, . . . , G Z d , where / denotes the Fourier coefficients of /. We shall prove 
in Section [2] that this does indeed define a probability measure. Note that when d = 1 
and when ei,...,efc are chosen to be k consecutive integers, the right-hand side above 
is the usual k x k Toeplitz determinant of / denoted Dk-i(f)- In particular, we have 
[77(e) = 1] = /(O) = J Jd f for every e G Z d . 

Example 1.1. As a simple example, if / = p, then P-^ is i.i.d. Bernoulli(p) measure. 

Example 1.2. For a more interesting example, consider 

sin 2 the 

f(x, y) := — — -2 ■ (1-1) 

sin 7ix + sin Try 

A portion of a sample of a configuration from P-^ is shown in Figure [1], where a square with 
upper left corner is colored black iff r)(i,j) = 1. These correspond to the horizontal 
edges of a uniform spanning tree in the square lattice. That is, if T is a spanning tree of 
Z 2 , then r)(i,j) is the indicator that the edge from to (i + belongs to T. The 
portion of the spanning tree from which Figure |1] was constructed is shown in Figure 0. 
When T is chosen "uniformly" (see Pemantle ( |1991[ ), Lyons ( |1998| ), or Benjamini, Lyons, 
Peres, and Schramm (|2001|) for definitions and information on this), then 77 has the law 
PA This follows from the Transfer Current Theorem of Burton and Pemantle (|1993| ) and 
the representation of the Green function as an integral; see Lyons with Peres (|2003| ) for 
more details. Similarly, the edges of the uniform spanning forest (it is a tree only for d < 4, 
as shown by Pemantle ( |1991|) ) parallel to the xi-axis in d dimensions correspond to the 
function 
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We remark that the uniform spanning tree is the so-called random-cluster model when one 
takes the limit q j 0, then p j 0, and finally the thermodynamic limit, the latter shown to 
exist by Pemantle (|1991| ). 




Figure 1. A sample from of |(l.f )| . 



Figure 2. A uniform spanning tree. 



Example 1.3. Let 



g(x) :-- 



sm7rx 



+ sm nx 

Then the edges of the uniform spanning tree in the plane that lie on the x-axis have the 



law P s , as shown in Example |5.5| below. 
Example 1.4. Let 



„ . 1 I sin27rx| — 1 

tlx) ■= — + 

JK 1 ' 2 2 cos 2nx 



An elementary calculation shows that 



( 1/2 




/(*) 



(fc-l)/2 



-1) 



(fc-l)/2 



1 2 v v-A' 

+ z E 



(-1)- 



7T " 27 + 1 



if k = 0, 

if 7^ is even, 



r if is odd. 



1 



We shall show in Example |5.21| that the measure P-^ arises as follows. Given a spanning 
tree T of the square lattice, let 77(71) be the indicator that e n G T, where e n is the edge 

[(n/2, 7i/2), (n/2 + 1, n/2)] if 71 is even, 

[((n + l)/2, (77 - l)/2), ((n + l)/2, (77 + l)/2)] if 77 is odd. 
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The collection of edges {e n ; n £ Z} is a zig-zag path in the plane. The law of r\ is P-^ when 
T is chosen as a uniform spanning tree. (Although it is not hard to see that the law of rj 
is Z-invariant, by using planar duality, and although the law of r\ must be a determinantal 
probability measure because the law of T is, it is not apparent a priori that the law of rj 
has the form P-^ for some /.) 

Example 1.5. Fix a horizontal edge of the hexagonal lattice (also known as the honeycomb 
lattice) and index all its vertical translates by Z. If one considers the standard measure 
of maximal entropy on perfect matchings of the hexagonal lattice, also called the dimer 
model and equivalent to lozenge tilings of the plane, and looks only at the edges indexed 
as above by Z, then the law is P-^ for / := 1 [1/3,2/3], as shown by Kenyon ( |1997| ). 



Example 1.6. It is interesting that the function / := 1 [0,1/2] f° r d = 1 also arises from a 
combinatorial model. In this case, 

if n = 0, 

f(n) = { if n 7^ is even, 

if n is odd. 




The measure P-Ms the zig-zag process of Johansson ( p002[) derived from uniform domino 



tilings in the plane. For the definition of "uniform" in this case, see Burton and Pemantle 
( P-993|) . A picture of a portion of such a tiling is shown in Figure |3|. Consider the squares on 



a diagonal from upper left to lower right. The domino covering any such square also covers 
a second square. If this second square is above the diagonal, then we color the original 
square black, as shown in Figure f|. Johansson ( |2002| ) showed that the law of this process 



is P-^ when the diagonal squares are indexed by Z in the natural way. More generally, 
the processes P-^ for / the indicator of any arc of T are used by Borodin, Okounkov, and 
Olshanski ( |20Q0| ) , Theorem 3, to describe the typical shape of Young diagrams. 



Example 1.7. Let < a < 1 and d = 1. If f(x) := (1 - a) 2 /\e 27Tix - a\ 2 , then ~P f is a 
renewal process (Soshnikov, |2000| ). The number of 0s between successive Is has the same 
distribution as the number of tails until 2 heads appear for a coin that has probability a 
of coming up tails. More explicitly, for n > 1, 

pf [77(1) = . . . = v (n - 1) = 0, 77(71) = 1 I 7/(0) = 1] = n(l - a) 2 a n_1 . (1.3) 

Since 



1 + a V 1 - ae 27Tix 1 - ae~ 27Tix 
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Figure 3. A uniform domino tiling, 
expansion in a geometric series shows that 



/(*) 



Figure 4. A sample from of Example 1.6 



1 — a 
1 + a 



We prove that P-^ is indeed this explicit renewal process after we prove Proposition |2.10 
in which we extend this example to other regenerative processes. 

Example 1.8. If < p < 1 and / : T d — > [0, 1] is measurable, then a fair sample of ~P p f 
can be obtained from a fair sample of P-^ simply by independently changing each 1 to a 
with probability 1 — p. 

For general systems, note that the covariance of 77(0) and r)(k) for k G Z d is — \ f(k)\ 2 . 
This is summable in k since / G L 2 (T d ), but that is essentially the most one can say 
for its rate of decay. That is, given any (a&) G £ 1 (Z d ), there is some (even continuous) 
/ : T d — > [0, 1] and some constant c > such that \ f{k)\ 2 > c\ak\ for all k G Z d , as shown 
by de Leeuw, Katznelson, and Kahane ( p.977 ) . Observe also that as is the case for Gaussian 
processes, the processes studied here have the property that if the random variables are 
uncorrelated, then they are mutually independent. 

It is shown (in a much more general context) by Lyons ( |2002aj) that these measures, 
as well as these measures conditioned on the values of 77 restricted to any finite subset of 
Z d , have the following negative association property: If A and B are increasing events that 
are measurable with respect to the values of rj on disjoint subsets of Z d , then A and B are 
negatively correlated. 
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In the order presented in this paper, our principal results are the following. 
• For all /, the process P-^ is a Bernoulli shift. This was shown by Shirai and Taka- 
hashi ( |2003| ) for those / such that Yl n >i n l/( n )| 2 < oo by showing that those P-^ are weak 



Bernoulli. 

• For all /, the process P-^ stochastically dominates product measure P GM( ^) and is 
stochastically dominated by product measure p 1 - GM ( 1 -/) 5 an d these bounds are optimal. 
This is rather unexpected for the process of Example |1.2| related to the uniform spanning 
tree; explicit calculations are given below in Example |5.13| for this particular process. We 
give similar optimal bounds for full domination (uniform insertion and deletion tolerance) 
in terms of harmonic means. 

• We present methods to estimate the entropy of PA For example, we show that 
for the function g in Example |1.3| , the entropy of the system P 9 lies in the interval 
[0.69005, 0.69013]; see Example 



• The process P-^ is strong full K iff there is a nonzero trigonometric polynomial T 
such that j\jzrf) £ L (T ). This is equivalent to phase uniqueness in the sense that no 
conditioning at infinity can change the measure. 

• In one dimension, P* is strong K iff /(l — /) has a positive geometric mean. This 
is equivalent to phase uniqueness when conditioning on one side only. Higher-dimensional 
versions of this will also be obtained. 

We shall give full definitions as they become needed. Some general background on 
determinantal probability measures is presented in Section 0, where we also exhibit a key 
representation of certain conditional probabilities as Szego infima. The property of being a 
Bernoulli shift is proved in Section |3|, while the auxiliary result that all P^ have full support 
(except in two degenerate cases) is shown in Section f|. Properties concerning stochastic 
domination are proved in Section |5|. These are used to estimate entropy in Section ^. More 
sophisticated methods of estimating entropy are also developed and illustrated in Section |[ 
The main result about phase multiplicity is proved in Section |7|, while the one-sided case 
for d = 1 and its higher-dimensional generalizations are treated in Section |8|. Finally, we 
end with some open questions in Section |j. 

The above definitions can be generalized to any countable abelian group with the 
discrete topology. For example, if we use Z x Z2, we obtain nontrivial joinings of the 
above systems with themselves. That is, suppose / : T — > [0, 1] and h : T — ► [0, 1] are 
such that f ± h : T —> [0,1]. Then the function f h : T x {-1, 1} -> [0, 1] defined by 
(x, e) 1— > f(x) + eh(x) gives a system that, when restricted to each copy of Z, is just P-f, 
but has correlations between the two copies that are given by h (the case h = gives the 
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independent joining). We can obtain slow decay of correlations between the two copies 
and negative associations in the joining itself; this is perhaps something that is not easy 
to construct directly. 



§2. Background. 

We first quickly review the probability measures studied in Lyons (|2002a|) ; see that 
paper for complete details. 

Let E be a finite or countable set and consider the complex Hilbert space £ 2 {E). Given 
any closed subspace H C £ 2 (E), let Ph denote the orthogonal projection onto H. There 
is a unique probability measure P H on 2 E ■= {0, 1} E defined by 

P H [v(ei) = l,...,7/(e fc ) = 1] =det[(P H e i ,e i )]i< <ii < fc (2.1) 

for all k > 1 and any set of distinct ei, . . . , e& G E; see, e.g., Lyons ( |2002aj) or Daley and 
Vere- Jones ( |1988|) , Exercises 5.4.7-5.4.8. On the right-hand side, we are identifying each 
e G E with the element of £ 2 (E) that is 1 in coordinate e and elsewhere. In case H 
is finite-dimensional, then P H is concentrated on subsets of E of cardinality equal to the 
dimension of H. 

More generally, let Q be a positive contraction, meaning that Q is a self-adjoint 
operator on £ 2 (E) such that for all u G £ 2 (E), we have < (Qu,u) < (u,u). There is a 
unique probability measure such that 

P Q [q{e 1 ) = l,..., V (e k ) = 1] = det[(Qe i} e^)]*^ (2.2) 

for all k > 1 and distinct ei,...,efe G E. When Q is the orthogonal projection onto a 
closed subspace H, then P^ = P H . In fact, properties of P^ can be deduced from the 
special case of orthogonal projections. Since this will be useful for our analysis, we review 
this reduction procedure. 



Note first that uniqueness follows from the fact that |(2.2)| determines all finite-dimen- 
sional marginals via the inclusion-exclusion theorem. Indeed, we have the following formula 
for any disjoint pair of finite sets A,B C E (see, e.g., Lyons ( |2002a|) ): 

P Q [r]\A = 1, r]\B = 0] = det [(l B (e)e + (-l) ls(e) Qe, e')l . (2.3) 

l Je,e'eAUB 

To show existence, let Pjj be any orthogonal projection that is a dilation of Q, i.e., if is a 
closed subspace of £ 2 (E') for some E' D E and for all u G £ 2 (E), we have Qu = Pp^P H u, 
where we regard £ 2 (E') = £ 2 (E)(B£ 2 (E'\E). (In this case, Q is also called the compression 
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of Ph to £ 2 (E).) The existence of a dilation is standard and is easily constructed; see, e.g., 
Lyons ( |2002a| ) . Having chosen a dilation, we simply define as the law of 77 restricted 
to E when r\ has the law P^. Then (2.2) is a special case of (2.1) 



A probability measure P on 2 E is said to have negative associations if for all pairs A 
and B of increasing events that are measurable with respect to the values of r\ on disjoint 
subsets of E, we have that A and B are negatively correlated with respect to P. The 
following conditional negative association (CNA) property is proved in Lyons ( 2UU2a| ) 
and a consequence of this (see Proposition will be used frequently here. 



Theorem 2.1. If Q is any positive contraction on £ 2 (E), A is a finite subset of E, and 
i]q E 2 a , then P^[ • | r)\A = rj ] has negative associations. 

We now assume that E = Z d . Then the group structure of Z d allows Z d to act 
naturally on £ 2 (Z* d ) and on 2 Z . The proof of the following lemma is straightforward and 
therefore skipped. 

LEMMA 2.2. If Q is a 7L d -invariant positive contraction on £ 2 (Z d ), then P^ is also Z d - 
invariant. 

As is well known, there exists a complex Hilbert-space isomorphism between L 2 (T d , A^) 
and £ 2 {Z d ) where T d is the (i-dimensional torus IR d /Z d and Xd is unit Lebesgue measure on 
T d . This isomorphism is given by the Fourier transform / 1— > /, where for / G L 2 (T d , Xd), 
we have f(k) = J Jd f(x)e~ 2irik ' x dXd(x) for k G Z d . If e k denotes the function x 1— > e 27 " fc ' x , 
then the isomorphism takes the set {e^ ; k G Z d } to the standard basis for £ 2 (Z d ). From 
now on, we shall abbreviate L 2 (T d : X d ) by L 2 (T d ). 

The following is well known. 

Theorem 2.3. 

(i) Let A C T d be measurable and consider the operator Ta '■ L 2 (T d ) — > L 2 (T d ) given by 

Ta(9) = 91a- 

Then these projections (as A varies over the measurable subsets ofT d ) correspond 
(via the Fourier isomorphism) to the 1i d -invariant projections on £ 2 (Z d ). 

(ii) More generally, let f : T d — > [0, 1] be measurable and consider the operator Mf : 
L 2 (T d ) -> L 2 (T d ) given by 

M f (g) = fg. 

Then these positive contractions (as f varies over the measurable functions from 
T d to [0, 1]) correspond (via the Fourier isomorphism) to the Z d -invariant positive 
contractions on £ 2 (Z d ). More specifically, Mf corresponds to convolution with f. 
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As in the above theorem, an / : T d — > [0, 1] yields a Z d -invariant positive contraction 
Q f on £ 2 (Z d ), which in turn yields a translation-invariant probability measure on 2 Z 
that we denote more simply by . 

Lemma 2.4. Given f : T d — * [0, 1] measurable and ei, . . . , e*. G Z d ? 

p/[ r? (e 1 ) = 1, . . . , 77(e fc ) = 1] = det[/(ej - e<)]i<i,j<A. • 
Proof. By definition, the left-hand side is det[(Q/ej, ej)]i<ij<fc. By Theorem |2~3|(ii), 

(Qfe u ej) = (M/e ei , e ej ) = f(ej - e*) . I 



Remark 2.5. Lemma 2.4 says that for d = 1, the probability of having Is on some finite 



collection of elements of Z is a particular minor of the Toeplitz matrix associated to /. 



Equation |(2.3)| shows a symmetry of P-^ and P 1 f , namely, if rj has the distribution 



P-^, then 1 — r\ has the distribution P 1- A Shirai and Takahashi ( |2003| ) prove the existence 
of P-^ by a different method and also note this symmetry. 

Although the last lemma gives us a formula for P-^ directly in terms of / without 
reference to any projections, it is still useful to know a specific projection of which Mf is 
a compression. Let / : T d -> [0, 1] be measurable. Identifying T d+1 with T d x [0, 1), we let 
Af C T d+1 be the set {(x, y) G T d+1 ; y < f(x)}. Consider the projection T Af of L 2 (T d+1 ) 
given in Theorem |2.3| (i). We view L 2 {T d ) as a subspace of L 2 (T d+1 ) by identifying g G 
L 2 (T d ) with g ® 1 G L 2 (T d+1 ), where (g ® l)(x,y) := g(x) for x G T d , y G T. The 
orthogonal projection P of L 2 (T d+1 ) onto L 2 (Y d ) is then given by g \— > (x i— ► J T gr(x, y) dy). 
A very simple calculation, left to the reader, shows that Mf is a compression of Ta s \ i-e., 

M f = PT Af (2.4) 

on L 2 (T d ) viewed CIS 3; subspace of L 2 (T d+1 ). For later use, let 

H f := {(7 G L 2 (T d+1 ) ; # = a.e. on (A f ) c } 

be the image of T Af ■ 

We next remind the reader of the notion of stochastic domination between two prob- 
ability measures on 2 E . First, if rj, 5 are elements of 2 E , we write r\ 5 if 77(e) < 5(e) for 
all e G -E 1 . A subset A of 2 s is called increasing if 77 G A and rj =4 5 imply that 5 G A. If z/ 
and /j are two probability measures on 2 E , we write =<; ^ if i'(A) < fx(A) for all increasing 
sets A. A theorem of Strassen (|1965|) says that this is equivalent to the existence of a 
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probability measure m on 2 E x 2 E that has v and /i as its first and second marginals (i.e., 
m is a coupling of v and /i) and such that m is concentrated on the set {(77, S) : r\ =^ 5} 
(i.e., m is monotone). 

Throughout the paper, we shall use the following consequence of conditional negative 
association (Theorem sometimes called joint negative regression dependence 

(see Pemantle ( POOCD ). 

Proposition 2.6. Assume that {Xi}i e i has conditional negative association, I is the dis- 
joint union of I\ and I2, and a, b G {0, with ai < bi for each % G I\. Then 

[{Xi} iel2 \Xi = bi,ie h) =4 [{Xi} ieh \Xi = ai,ie h] , 

where [Y \ A] stands for the law ofY conditional on A. 

Lemma 2.7. Let f u f 2 : T d -> [0, 1] with fx < f 2 a.e. Then P /x ^ P /2 . 

Proof. This follows from a more general result (see Lyons ( p002a|) ) that says that if Qi 
and Q2 are two commuting positive contractions on £ 2 (E) such that Q± < Q 2 in the sense 
that Q2 — Qi is positive, then P^ 1 ^ P^ 2 . However, here is a more concrete proof in our 
case. Since fi < f 2 a.e., it follows that Hf 1 C Hf 2 , which implies by Theorem 6.2 in Lyons 
( p002a|) that the projection measures P H fi and P H f2 on Z d+1 satisfy P H /i 4 P H ^ and 



hence their restrictions to Z d satisfy the same relationship; i.e., P-^ 1 ^ P-^ 2 . I 

We close this section with a key representation of certain conditional probabilities and 
an application. The minimum in |(2.5)| below is often referred to as a Szego infimum. For 
an infinite set B C Z d \ {0}, write 

P / [r/(0) = l\ r]\B = l} := lim P / [?/(0) = 1 | r)\B n = 1] , 

n — >oo 

where B n is any increasing sequence of finite subsets of B whose union is B. This is a 



decreasing limit by virtue of Proposition Let (■, ■)/ denote the usual inner product in 



the complex Hilbert space L 2 (f). For any set B C Z d , write [B]f for the closure in L 2 (f) 
of the linear span of the complex exponentials {e^ ; k G B}. 

Theorem 2.8. Let f : T d -> [0, 1] be measurable and B c Z d with ^ B. Then 



P f [n(0) = 1 I rj\B = 1] =min|y \l-u\ 2 fd\ d ; u G [B] f \ 



(2.5) 



Proof. It suffices to prove the theorem for B finite since the infinite case then follows by a 
simple limiting argument. So assume that B is finite. Note that f(k — j) = (e J5 e/j)j, so 
that 

P f [r f \B = l] = det[(e j , e k ) f ) j>keB , 
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and similarly for [f]\(B U {0}) = 1]. Thus, the left-hand side of (2.5) is a quotient 



of determinants. The fact that such a quotient has the form of the right-hand side is 
sometimes called Gram's formula. We include the proof for the convenience of the reader. 
Since eo = 1, it follows by row operations on the matrix [(e,-, Gk)]j } keBu{o} that 

P^[r 7 (0) = l|r 7 rS = l] = ||P [ i ]/ l|| 2 / , (2.6) 

where denotes orthogonal projection onto the orthogonal complement of [B]f in 

L 2 (f). Since this is the squared distance from 1 to [B]f, the equation |(2.5)| now follows. I 



An extension of the above reasoning, given in Lyons ( |2UU2a ), provides the entire 
conditional probability measure: 

Theorem 2.9. Let f : T d -> [0, 1] be measurable and B c Z d . Then the law ofn\(Z d \ B) 
conditioned on n\B = 1 is the determinantal probability measure corresponding to the 
positive contraction on £ 2 (Z d \ B) whose (j, k) -matrix entry is 



( p [i] f e h p [i} f e k) r 



forj,k <£ B. 



The Szego infimum that appears in Theorem |2 . 8| involves trigonometric approximation, 
a classical area that has strong connections to the topics of prediction and interpolation 
for wide-sense stationary processes. We briefly discuss these topics now. In later sections, 
we describe more explicit connections to our results. Recall that a mean-0 wide-sense 
stationary process is a (not necessarily stationary) process {Y n ) neZ d for which all the 
variables have finite variance and mean 0, and such that for each fc G Z d , the covariance 
Cov(Y n +k, Y n ) = ¥i\Y n+ kY n ] does not depend on n. There is then a positive measure G 
(called the spectral measure) on T d satisfying 

G(k) = Cov(Y n+k ,Y n ) 

for n,k G It turns out that for a one-dimensional wide-sense stationary process, if 
G is absolutely continuous with density g, then GM(g) = iff perfect linear prediction is 
possible, which means that Yq is in the closed linear span of {y n } n <_i in L 2 (0), where 
O is the underlying probability space. This was proved in various versions by Szego, 
Kolmogorov and Krein. Since the covariance with respect to P-^ of 77(0) and rj(k) for 
k G Z d is also given by a Fourier coefficient (namely — \f(k)\ 2 for k 7^ 0) and since, as we 
shall see, the geometric mean of / will play an important role in classifying the behavior of 
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P-' as well, one might wonder about the relationship between our determinantal processes 
and wide-sense stationary processes. It is not hard to show that P-^ (viewed as a wide-sense 
stationary process) has a spectral measure G that is absolutely continuous with density g 
given by the formula 

g:=f(0)l-f*f, 

where * denotes convolution and f{t) := f(l — t). Other than the trivial cases / = 
and / = 1, it is easy to check that g is bounded away from and so, in particular, its 
geometric mean is always strictly positive. This suggests that our results are perhaps not so 
connected to prediction and interpolation. However, it turns out that some of the questions 
that we deal with here (such as phase multiplicity and domination) concerning P-^ do have 
interpretations in terms of prediction and interpolation for wide-sense stationary processes 
whose spectral density is / (which does not include P-^). More specifics will be given in 
the relevant sections. 

Special attention is often devoted to stationary Gaussian processes, one reason being 
that their distribution is determined entirely by their spectral measure. It is known that a 
stationary Gaussian process with no deterministic component is a multistep Markov chain 
iff its spectral density is the reciprocal of a trigonometric polynomial; see Doob (|1944|) . 
The analogous property for determinantal processes is regeneration: 

Proposition 2.10. If d = 1, then f is the reciprocal of a trigonometric polynomial of 
degree at most n iff P-^ is a regenerative process that regenerates after n successive Is 
appear. 

This last property means that for any k, given that rj\[k + 1, k + n] = 1, the future, 
T)\[k + n + 1, oo), is conditionally independent of the past, n\(—oo,k]. 

Proof. Note first that because of Theorem [H| this regenerative property holds for P-^ iff 
for all j > and all C C (— oo, —n — 1], we have ej = -P[ij u c] / e j, where B := [— n, —1]. 
(Here, we are relying on the fact that \\Pfjj] e j\\f > ll^[Bu(7|/ e ill/ ^ ^ ne vec tors are not 
equal.) This is the same as ej _L [C]f for all j > 0, or, in other words, there 
exists some Uj G [B]f with — Uj _L [B U C]f. As this would have to hold for all C 
(and Uj is independent of C), it is the same as the existence of some Uj G [B]f such 
that Tj := (e^ — Uj)f is analytic, i.e., Tj(k) = for all k < 0. Now this implies that 
T = (1 — wo)/, whence T (l — Uq) = T (l — uo). The left side of this last equation is 
an analytic function, while the right side is the conjugate of an analytic function (i.e., its 
Fourier coefficients vanish on Z + ). Therefore, both equal some constant, c. Hence 

, = T = c = c 

1-Uq (1 -« )(1 -«o) 1 1 — Zi 1 2 
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which is indeed the reciprocal of a trigonometric polynomial of degree at most n. 

Conversely, if 1/ f is the reciprocal of a trigonometric polynomial of degree at most 
n, then since / > 0, the theorem of Fejer and Riesz (see Grenander and Szego (|1984|) , 
p. 20) allows us to write / = c/|l — uo\ 2 for some conjugate-analytic polynomial uq G [£?]/ 
such that the analytic extension of 1 — uq to the unit disc has no zeroes. We may rewrite 
this as (1 — uo)f = Tq for To := c/(l — uq). Since To has an extension to the unit 
disc as the reciprocal of an analytic polynomial with no zeroes, it follows that To is also 
analytic. Multiplying both sides of this equation by ei and rewriting exuof = ci(l— it^)/ = 
c\{uq — m'i)/ + ciT) for some constant c\ and some u\ G [B]f, we see that (ei — u{)f = T\ 
for u\ := C\{uq — u^) G [£?]/ and T\ := eiT + C\Tq, an analytic function. Similarly, 
we may establish by induction that for each j > 0, there is some Uj G [B]f such that 
Tj := (ej — Uj)f is analytic. This proves the equivalence desired. I 

We now use this proof to establish the explicit probabilistic form of the renewal process 
in Example |1.7| . In this case, B = {—1} and one easily verifies that Uj = a? + e_i, as is 
standard in the theory of linear prediction. Therefore, 

p f m = 1 1 rt-i) = i] = n^^iiu = hi-^ff = - = -« 2j+2 ) • 

It now suffices to verify that this is also true for the explicit renewal process described in 
Example |1.7| . First, it is well known from basic renewal theory that for a renewal process, 
the probabilities [rj(j) = 1 | rj(— 1) = 1] determine the distribution of the number of 
0s between two Is. Hence to verify the above statement, one simply needs to check that 
these latter probabilities are related to the interrenewal distribution via the appropriate 
convolution-type equation. In this case, it comes down to verifying that for all j > 0, 

1^(1 - a 2 ^ 2 ) = (j + !)(!- a) V + £ Jfe(l - afa k ~\l - a 2 ^ fc )+ 2 ) . 

1 + a 1 + a j-^ 1 



This identity is easy to check. 
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§3. The Bernoulli Shift Property. 

In this section, we prove that the stationary determinantal processes studied here are 
Bernoulli shifts. We assume the reader is familiar with the basic notions of a Bernoulli 
shift (see, e.g., Ornstein Q1974Q ). 



Theorem 3.1. Let f : T d — ► [0, 1] be measurable. Then P-^ is a Bernoulli shift; i.e., it is 
isomorphic (in the sense of ergodic theory) to an i.i.d. process. 

Before beginning the proof, we present a few preliminaries. We first recall the defini- 
tion of the (i-metric. 

Definition 3.2. If [i and v are Z d -invariant probability measures on 2 Z , then 
d(n,v) := infm [{(??, 5) G 2 Z " x 2 Z< * ; 77(0) ^ 5(0)} 

ml . 

where the infimum is taken over all couplings mof/i and v that are Z d -invariant. 

The following is a slight generalization of a well-known result (see, for example, page 75 
in Liggett (|1985| ) ) . The proof is an immediate consequence of the existence of a monotone 



coupling and is left to the reader. 

Lemma 3.3. Suppose that o~\, o~i, v and \i are 7L d -invariant probability measures on 2 
such that v ^ o~i =^ \i for both i = 1,2. Then 

d{a 1 ,a 2 )<f J ,[ V {0) = l]-u[r } {0) = l]. 

Proposition 3.4. Let f, g : T d — > [0, 1] be measurable. Then 

d(pf,I>°)< [ \f-g\d\ d . 

Proof. Because of Lemma ^.7| , we may apply Lemma |3T3] to o\ := P-^, o"2 := P s , v := P-^ Aff , 
and fj, := P fWg . We obtain that 

d(pf,P°)< [ fVgdXi- f fAgd\ d = [ \f-g\d\ d . I 

JT d JT d JT d 



Proof of Theorem \3. 1\ . The first step is to approximate / by trigonometric polynomials. 
Let K r be the rth Fejer kernel for T, 
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and define Kf{x\, . . . , Xd) '■= Y\^=i K r (xi). It is well known that Kf is a positive summa- 
bility kernel, so that if we define g r by 

g r :=f*K?, 

then < g r < 1 and lim r _». 00 g r = f a.e. and in L 1 (T d ). 

Next, since each Kf is a trigonometric polynomial, so is each g r . From this and 



Lemma |2^4|, it is easy to see that there is a constant C such that P 9r (Af]B) = P 5r (A)~P 9r (B) 
if A is of the form r\ = 1 on Si and B is of the form r\ = 1 on S2 with Si and S2 being 
finite sets and having distance at least C between them. From this, a standard argument in 
probability (a n-X argument) shows that if A is any event depending only on locations Si 
and if B is any event depending only on locations £2 with Si and £2 possibly infinite sets 
having distance at least C between them, then ~P 9r (A D B) = Y >9r (A)Y >9r (B). A process 
with this property is called a finitely dependent process and this property implies it is 
a Bernoulli shift (e.g., the so-called very weak Bernoulli condition is immediately verified; 
see Ornstein ( |1974j ) for this definition for d = 1 and, e.g., Steif ( |1991| ) for general d). 

Since the processes that are Bernoulli shifts are closed in the d metric (see Ornstein 



( P-974D ) and Proposition pT4] tells us that linv^oo d(P 9r , P J ) = 0, we conclude that I* f is 



a Bernoulli shift. I 

Remark 3.5. An important property of 1-dimensional stationary processes is the weak 
Bernoulli (WB) property. This is also referred to as mixing" and "absolute regularity" 
in the literature. Despite its name, it is known that WB is strictly stronger than Bernoul- 
licity. It is easy to check that "aperiodic" regenerative processes and finitely dependent 
processes are WB. Hence, by our earlier results, if / is a trigonometric polynomial or the 
inverse of a trigonometric polynomial (or if 1 — / is the inverse of a trigonometric polyno- 
mial), then P-^ is WB. This is subsumed by the independent work of Shirai and Takahashi 
( P003|) , who showed that P-^ is WB whenever J^ n>1 n\f(n)\ 2 < 00. The precise class of / 
for which P-^ is WB is not known. We also note that it follows from Smorodinsky ( 1992Q 



that if / is a trigonometric polynomial, then P-^ is finitarily isomorphic to an i.i.d. process. 
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§4. Support of the Measures. 

In this section, we show that all of the probability measures P-^ have full support 
except in two degenerate cases. We begin with the following lemma. 

Lemma 4.1. Let H be a closed subspace of £ 2 (E) and let A and B be finite disjoint subsets 
ofE. Then 

P H [n(e) = 1 for e e A, 77(e) = for e e B] > 
if and only if {PH(e)} ee A U {Pjj±(e)} ee B is linearly independent. 

Proof. In the special case that A or B is empty, the result follows from Lyons ( |2002a|) . In 
general, {PH(e)} ee A U {P#^ ( e )}ee b is linearly independent if and only if {Ph (e)} e eA and 
{P H ±(e)} ee B are each linearly independent sets. Also, P H [q\A = 1, n\B = 0] > if and 
only if P H [n\A = 1] > and P H [r]\B = 0] > since 

P H [n\A = 1, n\B = 0] > P H [n\A = 1]P H [r)\B = 0] 



by the negative association property, Theorem 2.1. Thus, the general case follows from 



the special case. I 

THEOREM 4.2. P-^ has full support for any function f : T d — » [0, 1] other than or 1. 

Proof. Since marginals of probability measures with full support clearly have full support, 
it suffices to prove the result for P H when H is a closed Z d -invariant subspace of £ 2 (Z d ) 
other than £ 2 (Z d ) or 0. If we translate over to L 2 (T d ), we see, by Lemma |4.1| , that it 
suffices to show that if A C T d with < Xd(A) < 1 and n\, . . . , rik, mi, . . . , nig are all 
distinct elements of Z d , then 

{e nj lA}i<j<fc U {e mr l J 4c} 1 < r <^ 

is linearly independent. Suppose that ci, . . . , c^, d±, . . . , di are complex numbers such that 

fc i 
y^Cj-e^.l^ + ^2d r e mr l A c = 0. 

j=l r=l 

Prom this it follows that Y^ , j=i c j e n j = a.e. on A. Since Xd(A) > 0, we have that 
^2j=i c j e nj is on a set of positive measure. It is well known that this implies that 
ci, . . . , Ck vanish (the proof uses induction on d and Fubini's theorem). Similarly, d±, . . . , di 
vanish. 1 
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5. Domination Properties. 



In this section, we study the question of which product measures are stochastically 
dominated by P-^ and which product measures stochastically dominate PA For simplicity, 
we give our first results for d = 1, and only afterwards describe how these results extend 
to higher dimensions. We also treat a different notion of "full domination" at the end of 
the section. Recall that for / : T d — > [0, 1], we define 



We introduce an auxiliary stronger notion of domination than but only in the case 
where one of the measures is a product measure. Let \i v denote product measure with 
density p. 

Definition 5.1. A stationary process {rj n } n!E z with distribution v strongly dominates 
/Up, written \i p ^ s u, if for any n and any a±, . . . ,a n G {0, 1}, 



Similarly, we define v ^ s \x v if the above inequality holds when > is replaced by <. 

The following lemma is easy and well known; it is sometimes referred to as Honey's 
lemma. 

Lemma 5.2. If \x v =<; s v, then \x v =^ v. 



The optimal improvement of these stochastic bounds is as follows. 

Theorem 5.3. For any / : T — > [0,1], we have \i p ^ P / iff p < GM(/) iff p p ^ s P / . 
Similarly, =4 p q iff q > 1 — GM(1 — /) iffPf =4 S fJ>q- In addition, for any stationary 
process p that has conditional negative association, we have p p =4 (i iff p p =4 S p. 

Proof. Let d n := D n -i(f) be the probability of having n Is in a row. According to Szego's 
theorem (Grenander and Szego ( |1984p , pp. 44, 66), d n+ i/d n is decreasing in n and 




P[?7o = 1 \ rji = a,i,i = 1, . . . , n] > p . 



The converse is not true, as we shall see later (Remark |6.4[ ). In light of Lemma [277 



we have that if p < f < q, then 



lim d n+ i/d n = GM(/) = lim d]( n . 



n 



In particular, dn+i/dn > GM(/) for all n. 
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Proposition |2.6| implies that for any fixed n, 



P f [r) Q = 1 | j]i = a»,i = l,...,n] 

is minimized among all oi, . . . , a n G {0, 1} when a,\ = 0,% = • • • = a n = 1. In this case, 
the value is d n+ i/d n . Since this is at least GM(/), we deduce that if p < GM(/), then 
\x v pf and hence that \i v ^ P? . 

Conversely, if \i v =<! P-^, then certainly p n < d n for all n. Hence p < GM(/). The 
second to last statement can be proved in the same way or can be concluded by symmetry 
Finally, the last statement can be proved in a similar fashion. I 

Remark 5.4. The above theorem gives us two interesting examples. First, if we take 
/ := 1a where A has Lebesgue measure 1 — e, then d(P\5±) < e by Proposition |3.4| , 
but nonetheless, P-^ does not dominate any nontrivial product measure since GM(/) = 0. 
Second, if we take / := l[o,i/2] + -4 • l[i/2,i]> then / < 1/2 on a set of positive measure, 
but nonetheless Pf dominates P 1/2 = fi 1 / 2 since GM(/) = (.4) 1 / 2 > 1/2. 

Example 5.5. Let / be as in Example |1.2j , so that P-^ is the law of the horizontal edges 
of the uniform spanning tree in the plane. In order to examine a 1-dimensional process, 
let us consider only the edges lying on the x-axis. If we let 

9(b) '■= / f(x,y)d\i(y) , 
Jt 

then the edges lying on the x-axis have the law P 3 since 'g(k) = f(k, 0) for all k G Z. Since 
an antiderivative of 1/(1 + a sin 2 ity) is 

arctan (a/1 + atan7ry) 
71VI + a 

we have 

sin ttx 



+ sin 2 ttx 

as given in Example |1.3| . We claim that P 9 strongly dominates fi p for p := \/2 — 1, 
and this is optimal. In order to show this, we calculate GM(g). Write g±(x) := sin 2 7rx. 
Then (GM((?)) 2 = GM((7i)/GM(l + g{). Let G± be the harmonic extension of log g± from 
the circle to the unit disc. Then J loggidAi = Gi(0) by the mean value property of 
harmonic functions. Since gi = |(1 — ei)/2| 2 , we see that Gi is the real part of the 
analytic function z 1— > 21og[(l — z)/2] in the unit disc, from which we conclude that 
Gi(0) = log(l/4). Therefore, GM( gi ) = 1/4. Similarly, l + gi = | [^2 + 1 - (V2 - l)ei]/2| 2 , 
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whence GM(1 + Qi) = [(V^ + l)/2] 2 . Therefore GM(g) = y/2 - 1 = 0.4142+, as desired. It 
turns out that q := 1 - GM(l - g) = 1 - 2( v / 2 - l)e" 2G /^ = 0.5376+, where 



G:=V-A — ^=0.9160" (5.1) 

is Catalan's constant. Thus, P 9 It is interesting how close p and q are. 

We now introduce some new mixing conditions. Given a positive integer r, we may 
restrict P-^ to 2 rZ . If we identify rZ with Z, then we obtain the process P^ r , where 



r 



where r _1 t:={xGT; rx = i}. The reason that this restriction is equal to P^ r is that 
for all k G Z, we have / r (fc) = f(rk), as is easy to check. Because of this relation, we 
have that J \f r (t) — f(0)\ 2 d\i(t) -> as r -» oo, so that by Lemma |3l|, it follows that 
d(P^ r , P^(°)) — > as r — > oo. (It is not hard to show that a similar property holds for any 
Kolmogorov automorphism (see Definition pM|) , while the example in Remark [T4] shows 
that this property can occur in other cases as well.) In fact, we often have a stronger 
convergence for our determinantal processes, as we show next. 

Theorem 5.6. Let f : T — > [0, 1] be measurable. If GM(/) > or f is bounded away 
from on an interval of positive length, then there exist constants p r — > /(0) such that 
p/r ppr y or a ^ r _ Therefore, i/GM(/(l — /)) > or if f is continuous and not equal 
to nor 1, £/ien there exist constants p r ,q r — > /(0) swc/i t/iat P Pr ^ P-^ r ^ P 9r /or all r. 



Of course, in view of Theorem |5.3| , we use p r :— GM(f r ) and q r := 1 — GM(1 — / r ). 
Thus, the theorem is an immediate consequence of the following lemma. 

Lemma 5.7. Let f : T — > [0, 1] 6e measurable. If GM(/) > or / is bounded away from 
on an interval of positive length, then GM(f r ) — > /(0) as r ^ oo. 

Proof. We have seen that / r — ► /(0) • 1 in L 2 , whence also in measure. Therefore log / r — ► 
(log /(0)) • 1 in measure. Thus, it remains to show that {log f r } is uniformly integrable (at 
least, for all large r). Suppose first that GM(/) > 0. 

Given h G L 1 (T), write h {r \t) := ft(rt), so that hi r ){k) is when k is not a multiple 
of r and is h(k/r) when k is a multiple of r. For any g, /i G L 2 (T), we have 



fcez fcez fcez 



g(t)h(r)(t)dX 1 (t) = / g(t)h(rt)dX 1 (t). (5.2) 
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Therefore, for any set A C T, we have 

/ log(f r )d\ 1 > [ (logf) r d\ 1 = [ log fdXf, 

JA JA Jr~ 1 A 

the inequality follows by concavity of log, while the equality follows from |(5.2)| applied to 
g := log / and h := 1 A - Given any e G (0, /(0)), let := {t ; f r (t) < e}. Note that for 
any measurable A C T, we have Ai(r _1 A) = Ai(A). Since / r — >■ /(0) • 1 in measure, it 
follows that 

lim Ai(r -1 ^) = lim Ai(^) =0. 



i — >oo 



Since log / is integrable, it follows that 



lim / log / d\i = . 

This establishes the uniform integrability in the first case. 

In the second case where / > c > on an interval of length e > 0, we have that 
f r > ce on all of T for all r > 1/e. It is then obvious that {log/ r } is uniformly integrable 
for all r > 1/e. I 

Remark 5.8. There are functions / with / > a.e., yet GM(/ r ) = for all r. For example, 
enumerate the rationals {xj ; j > 1} in (0, 1) and choose 6j > such that Xj + ej < 1 and 
Y^j £j < 1- Define A n := [0, 1] \ [Jj >n [xj, Xj + £j] and fo(x) := e _1//x . Then one can show 
that 

oo 

f(x) := f (x)l Ao (x) + fo(x - x n )l [Xn , Xn+£n]nA Jx) 



n = l 



is such an example. 



We turn next to the extension of the prior results to higher dimensions. This is 
basically straightforward, but has interesting applications, as we shall see. First, we recall 
the usual notion of the past a-field Past(/c) at a point k 6 Z d . We use lexicographic 
ordering on Z d , i.e., write (k\, &2, • • • , kd) -< (h,h, ■ ■ ■ ,ld) if K < U when % is the smallest 
index such that ki ^ U. Then define Past(fc) to be the cr-field generated by rj(j) for all 
j -< k. More generally, the past a-field could be defined with respect to any ordering of 
Z d , which means the selection of a set IT C Z d that has the properties II U (— II) = Z d \{0}, 
IT fl (—11) = 0, and IT + IT C IT. The associated ordering is that where k -< I IS I — k G II. 
Pastn(w) will denote the past of u with respect to the ordering IT, i.e., {v ; v -< u}. Thus, 
Pastn(O) is just —II. (For a characterization of all orders, see Teh ( |1961|) , Zaiceva ( |1953|) , 



or Trevisan ( |1953|) .) As before, let \i v denote product measure with density p. 
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Definition 5.9. Given an ordering EE, a stationary process {rjn} n ei d with distribution v 
strongly dominates fj, p , written \x p u, if 

P[t7o = 1 | Pastn(O)] > p i/-a.s. 

Similarly, we define v =^ s \i p if the above inequality holds when > is replaced by <. (Note 
that the ordering IT here is suppressed in the notation.) 

Although we have phrased it differently, in one dimension, this is equivalent to Defi- 
nition pTTI when IT = {—1, —2, . . .}. Again, we have a version of Holley's lemma: 

Lemma 5.10. Given any ordering II, if fj, p ^ s v, then n p =^ v. 

(Note that to produce a monotone coupling for a general ordering with respect to a 
set IT, it is enough to do so for the measures restricted to any finite B C Z d . Given such a 
B, order B by the restriction of -< to B and couple the measures by adding sites from B 
in this order.) 

We now prove 

Theorem 5.11. Fix any ordering IT. For any measurable f : T d — > [0, 1], we have fi p =4 
iffp < GM(/) ifffi p 4s P f Similarly, pf 4 V q iff q > 1 - GM(1 - /) iffPf ^ s fi q . In 
addition, for any stationary process \i that has conditional negative association, we have 
Hp =4 H iff Hp H- 

Proof. Proposition |2.6| and Theorem |2.8| imply that 

essinf P / 77(0) = 1 | Past n (0) = inf |p / [ry(0) = 1 | rj\A= 1] ; A C -IT is finite} 

= p / [r 7 (o) = i|7 7 r(-n) = i] 

= min|y \1-T\ 2 fd\ d ; T G [-%} . 

By Helson and Lowdenslager ( |1958| ), the latter quantity equals GM(/). Therefore, p < 
GM(/) iff Hp ^ s P^. On the other hand, if n p ^ P^, then let A(r) be the finite subset of 
—IT consisting of all points within some large radius r about 0. Let a± -< 02 -< • • • -< a n be 
the elements of A(r) in order, where n :— \ A(r)\, and write Aj := A(r) n (—11 + a,j). Then 

n 

P n < P f [n\A(r) = 1] = n p/ ^( fl j) = 1 I V\ A i = !] • 

3=1 

Most terms in this product are quite close to GM(/), while all lie in [GM(/), 1]. It follows 
that 

lim P f [n\A(r) = l] 1 /™ = GM(/) , 

r — >oo 

so that p < GM(/). Finally, the remaining statements can be proved in a similar fashion. 

I 
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Remark 5.12. It follows from Theorem |5.11| that the choice of ordering IT does not de- 



termine whether \i v ^ s P^. This is not true for general stationary processes, even in one 
dimension. We are grateful to Olle Haggstrom for the following simple example. Let \i be 
the distribution on {0, l} 2 given by 

fi = (l/7)(S m + 5 ( o,i)) + (2/7)«y (lj0) + (3/7)<5 (M) . 

Let r] G {0, 1} Z be such that with probability 1/2, (r)2 n , Vin+i) are chosen independently 
each with distribution \x and otherwise {r]2n-i,V2n) are chosen independently each with 
distribution \x. Let v be the law of r\. If IT := Z~, then \x v v iff p < 1/2, while if 
n := Z+, then (jt p ^ s v iff p < 4/7. 

Example 5.13. Let / be as in Example |1.2| , so that P-^ is the law of the horizontal edges of 
a uniform spanning tree in the square lattice Z 2 . By Kasteleyn ( |1961|) or Montroll (|1964j) , 
we have 

f 4G 

/ log4(sin 2 7rx + sin 2 7n/)dA 2 (x,y) = — = 1.1662+, (5.3) 
where G is again Catalan's constant |(5.1)| . As we have shown in Example |5.5| , GM(sin 2 nx) = 



1/4, whence by IfOH GM(/) = e" 40 ^ = GM(1 - /), where we are using the observation 
that 1 — f(x,y) = f(y,x). (This last identity has a combinatorial reason arising from 
planar dual trees.) Therefore \x v ^ P-^ ^ ^\-p for p := e -40 /^ = 0.3115+ and this p is 
optimal (on each side) . This result in itself is rather surprising and it would be fascinating 
to see an explicit monotone coupling. 

For / : T d — > [0, 1] measurable and r = (ri, r%, . . . , rj) G Z d with all rj > 0, define 

/-(*):= =5^- E A*)' 

llj = l r j xEr- 1 t 

where r _1 (ti, t2, ■ ■ ■ , td) '■= {(xx,X2, ■ ■ • , Xd) G T d ; Wj TjXj = tj}. Write r — > oo to mean 



that minrj — > oo. The following is a straightforward extension of Theorem |5.6| . We leave 
its proof to the reader. 

Theorem 5.14. Let f : T d — > [0,1] be measurable. If GM(/) > or f is positive on a 
non-empty open set, then there exist constants p r — » /(0) as r — > oo such that P-^ r !>= P Pr 
for all r. Therefore, i/GM(/(l — /)) > or if f is continuous and not equal to nor 1, 
then there exist constants p r , q r — > /(0) such that P Pr =^ P-^ =^ P 9r for all r. 

Consider now a domination property even stronger than our previously defined strong 
domination. Let Z be the a-field generated by n(k) for k ^ 0. 
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Definition 5.15. A stationary process {rj n } ne zd with distribution v fully dominates 
\i v , written \i v =4f v, if 

P[?7o = 1 I Z] > p v-a..s. 

In this situation, we also say that v is uniformly insertion tolerant at level p. Similarly, 
we define v =^f \i v if the above inequality holds when > is replaced by <, and say that v 
is uniformly deletion tolerant at level 1 — p. We say that v is uniformly insertion 
tolerant if \x v v for some p > and that v is uniformly deletion tolerant if v =^f fi p 

for some p < 1. 

We show that the optimal level of uniform insertion tolerance of a determinantal 
process P-^ is the harmonic mean of /, defined as 



HM(/) 



d\ 

i d f 



Note that HM(/) = iff 1/f is not integrable. 

Theorem 5.16. For any measurable f : T d — > [0,1], we have /j p 4{ P f iff p < HM(/). 
Similarly, P^ ^ f n q iff q > 1 - HM(1 - /). 



Proof. By Proposition |2.6| and Theorem [LQ, we have 
ess inf P-^ 



^(0) = 1 | Z = inf |p / [r ? (0) = 1 | n\A = 1] ; A C Z d \ {0} is finite } 

= P / [r ? (0) = 1 | = 1] = min |y |1 - T| 2 / dA d ; T G [B]/ J 



where B := Z \ {0}. As shown by Kolmogorov ( p. 941a , 1941 b| ) for d = 1, the latter equals 



HM(/). The proof extends immediately to general d. This proves the first assertion. The 
second follows by symmetry. I 

Remark 5.17. Since a proof of Kolmogorov's theorem that 

minjy \1-T\ 2 fd\ d ; T G [£?],} = HM(/) , 

where B \=7h d \ {0}, is difficult to find in readily accessible sources, we provide one here. 
We have 



nun jy \1-T\ 2 fd\ d ; T G [%} = ||u|| /s 



1 2 



where 

D-L 
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Now g J_ [B]f iff g e £ 2 (/) and = for all k e B. The latter condition holds iff fir/ 

is a constant. If g ^ 0, then we deduce that 1/ / G L 2 (f), i.e., HM(/) > 0. Therefore, 

± _ JO ifHM(/)=0, 
\C// ifHM(/)>0. 

Hence u = if HM(/) = 0, while otherwise, 



u 



HM(/)//)/| 2 = HM(/) 



as desired. 



Remark 5.18. It is easy to find / such that GM(/) > and HM(/) = 0. Indeed, a natural 
such example is the function g of Example |1.3| . For any such /, the corresponding process 
P-^ strongly dominates a nontrivial product measure, but does not fully dominate any 
nontrivial product measure. In addition, for any function / such that HM(/) > and / is 
not constant a.e., GM(/) > HM(/) (as a consequence of Jensen's inequality), so that P^ 
will strongly dominate strictly more product measures than it will fully dominate. 

Example 5.19. Let / be as in |(1.2)| , so that P-^ is the law of the edges of the uniform 
spanning forest in Z d that lie parallel to the xi-axis. Then P-^ is uniformly deletion tolerant 
iff d > 4. This is because 1/(1 — /) is integrable iff d > 4. For example, when d = 4, we 
obtain full domination by (jl p with p := 0. 66425", where we have calculated HM(1 — /) as 
follows. First, we have 

1/(1-/) = 1+ Sin27nEl 



E4 • 2 

=2 sm 7tx 3 



'3 

whence 

If 1 

!/(! - /)^4 = 1 + - / —4 rfA 3 (x2,X3,X 4 ) 

T 4 1 Jj 3 Z^j=2 Sm nx 3 



If 1 

1 + r / ; dX 2 (x2,X3, 



2 



J=2 Slli TTXj 



This last integral has no simple form and is calculated numerically. This gives the value 
reported for 1 — HM(1 — /). For large d, we have 1 — HM(1 — /) ~ 1/d. Indeed, write A^_ x 
for Lebesgue measure on Letting 

A d -= f , — \ rfA d _i(x 2 ,...,x d ), 

Jt 11 - 1 2^j =2 2sin ttxj 
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we have that 



(d-l)(l-HM(l-/)) 



A- 



Hence it suffices to show that lim^oo 

d 



A, 







l + A d /(d-l) ' 
1. To show this, note that 



dt 



< 



+ 



+ 



J=2 
10(rf-l) 



Xd-i I ^2 2 sin2 wx 3 < (d-i) A 

d 

A d _i f ^2 2 sin2 < (d-1) A 

=2 

d 

A d _i[^2sin 2 7rx J < (d- l)/2 



dt 



10(d-l) 



1[0,1](*) rft 



Vj < 1/2 and 2^ 8x j < X )A 

i=2 



1 



dt 



as d — ► oo, where we have used the weak law of large numbers and the bounded convergence 
theorem for the first piece, a standard large-deviation result for the second piece, and an 
easy estimate on the third piece. The reverse inequality obtains by using only the first piece. 
Thus, Ad — > 1, as desired. This value of 1 — HM(1 — /), which gives a full domination 
upper bound on P-^, should be compared to /(0) = 1/d (by symmetry), which is the 
probability of a 1 at a site. On the other hand, for no d is the process uniformly insertion 
tolerant since 1/ f is never integrable. We remark that for the full uniform spanning forest 
measure on Z d (considering edges in all directions), we have change intolerance, meaning 
that P[r/(0) = 1 | Z\ G {0, 1} a.s. This follows from a result of Benjamini, Lyons, Peres, 
and Schramm (2000), as explained by Heicklen and Lyons (|2002|) . 

Example 5.20. Let g(x) be as in Example |1.3| , so that the edges of the uniform spanning 
tree in the plane that lie on the x-axis have the law P 9 . We have P 9 =^f \x v for p : = 
(1 + 7r)/(l + 2tt) = 0. 56865+, and this is optimal. This is because 

/ 1/(1 — g)dX= / ( 1 + sin 2 tix + sin tix yl + sin 2 tix J d\(x) 

Jj 

An antiderivative of this integrand is 



1 

•— arctan 

7T 



COS TIX 



1 + sin tix 



— cos 7rx v 1 + sin tix . 
2ti 
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whence the remaining integral is 1/2 + 1/n. Therefore Ll/(1 — g) dX = 2 + l/n and 
1 — HM(1 — g) = (1 +7r)/(l + 27t), as desired. Observe that / 1/(7 = 00 and so the process 
is not uniformly insertion tolerant. Similarly, for the edges lying on the x-axis of the 
uniform spanning tree in 3 dimensions, we have full domination by \x v with p : = 0.37732" 1 ". 
Here, the law of these edges is P , where 

h{x) := f(x,y,z)dX 2 {y,z) = - Sm = d\i(y) . 

^ T2 ^ t \f (sin 2 ttx + sin 2 ny) (l + sin 2 ttx + sin 2 7ry) 

This integral has no simpler form, so to compute p : = 1 — HM(l — h), we calculated 
h numerically and used the result to calculate the harmonic mean numerically. Since 
J 1/h = 00, as is easily checked, this process is not uniformly insertion tolerant. Similarly, 
one can check that the process of edges lying on the x-axis of the uniform spanning tree 
in d > 4 dimensions is not uniformly insertion tolerant. 

Example 5.21. Let / be as in Example [D§ It turns out that GM(/) = e ~ 2G ^ /y/2 = 
1 — GM(1 — /) = 0.39467" 1 ". This gives strong domination inequalities. It is easy to see 
that HM(/) = HM(1 — /) = 0, so there are no nontrivial full domination inequalities. The 
interest of this function is that it describes the process of edges along a zig-zag path in the 
plane for the uniform spanning tree, as claimed in Example |1.4| . We now sketch how to 
prove this. The Transfer Current Theorem of Burton and Pemantle (|1993| ) allows one to 
calculate, via determinants, the law for any set of possible edges of the uniform spanning 
tree. Thus, it is enough to verify that for the edges belonging to the zig-zag path, the 
matrix entries are those of the Toeplitz matrix associated to the Fourier coefficients given 
in Example |1.4| . For even k, the values of f(k) are given in Lyons with Peres ( |2003| ) . (These 
values imply the astonishing fact that the edges in the plane that lie along a diagonal, e.g., 
the horizontal edges with left endpoints (n,n) (n G Z), are independent, i.e., have law 
Thus, it remains to treat the case of odd k. A straightforward application of the 
Transfer Current Theorem gives that the matrix entry corresponding to the edges eo and 
e 2 fc-i is 

e(s) + e(t) - e(s + 1) - 1 „ , . . N 

e(ks + kt)dX 2 (s 1 t) 



4 - e(s) - e(-s) - e(t) - e(-t) 

e(s) + e(x — s) — e(x) — 1 



, —e(kx)dX 2 (s 1 x) 1 

4 — e(s) — e(—s) — e(x — s) — e(s — x) 

where e(x) := e 27Flx . Evaluate the integral in s for fixed x by a contour integral 

1 / z + e(x)z~ 1 — e(x) — 1 dz 



2m J A — z — z 1 — e(x)z 1 — e(— x)z z 
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over the contour \z\ = 1. The integrand has poles inside the unit disc at z = and 

_ 2- y/4- |l + e(x)| 2 
l + e(-x) 

After use of the residue theorem and integrating in x, one obtains f(2k — 1), as desired. 

We close this section by describing how our domination results can be interpreted 
in terms of prediction and interpolation questions for wide-sense stationary processes. In 
view of Theorem |2.8| and the well-known correspondence between prediction and Szegd 
infima, for d = 1, P-^[?7(0) = 1 | r)\{— 1, —2, . . .} = 1] is exactly the mean squared error 
for the best linear predictor of Yq given (y n ) n <_i, where (Y n ) is a wide-sense stationary 
process with spectral density /. Similarly, P^[r/(0) = | rj\{— 1, —2, . . .} = 0] is exactly the 
mean squared error for the best linear predictor of Yq given (Y" n ) n <_i, where (Y n ) is now a 
wide-sense stationary process with spectral density 1 — /. This gives us a correspondence 
between strong domination and prediction. An analogous correspondence holds between 
full domination and interpolation, where one instead looks at the mean squared error for 
the best linear predictor of Yq given (Y n ) n ^ . 



§6. Entropy. 

We assume the reader is familiar with the definition of the entropy H (/j.) of a process 
/i, as well as basic results concerning entropy (see Walters ( |1982| ) and Katznelson and 
Weiss ( p.972p ). Because of Ornstein's theorem (and its generalizations, see Katznelson 
and Weiss ( |1972D , Conze ( |1972/73| ), Thouvenot (|T97^ ), and Ornstein and Weiss ( |T987D ) 
that entropy characterizes Bernoulli shifts up to isomorphism, the following question is 
particularly interesting: 

Question 6.1. What is H(P f )l 

We know the answer only in the trivial case where / is a constant and in case / or 1 — / 
is the reciprocal of a trigonometric polynomial of degree 1. In principle, as we shall see, 
one can also determine the entropy when / or 1 — / is the reciprocal of any trigonometric 
polynomial, but the formula would be rather unwieldy. 

In general, then, we shall discuss how to estimate the entropy of P-^. The definition 
of entropy always provides upper bounds, due to subadditivity, so the harder bound is the 
lower bound. As we shall see, reciprocals of trigonometric polynomials can be used to get 
arbitrarily close lower (and upper) bounds. Unfortunately, that method is not practical for 
precise computation. Nevertheless, in many cases, we have another method that appears 



6. Entropy 



29 



to work quite well. Indeed, our method seems to provide upper bounds that converge more 
quickly than does use of the definition. 

Shirai and Takahashi ( p003|) proved that H(Pf) > for all / ^ 0, 1. Of course, this 



also follows immediately from our Theorem |3.1| . 
Let 

H\p] := H(fjt p ) = -plogp - (1 -p) log(l -p) . 

Theorem |5.11| yields easy lower bounds on the entropy of those processes P-^ such that 
f(l — f) has a strictly positive geometric mean. We shall obtain more refined lower bounds 
later in the one-dimensional case. It is easy to see that 

H v 4s H Hi-p H(p) > H[p] . (6.1) 



By Theorem |5.11| and |(6.1)| , we deduce the following lower bound on entropy: 
Proposition 6.2. For any measurable f : T d — > [0, 1], we have 

H{P f ) > min{fr[GM(/)],fr[GM(l - /)] 



Remark 6.3. This can be compared to the trivial upper bound H(pf) < H\f(0)]. 
Note that /(0) is the arithmetic mean of /. Shirai and Takahashi (|2003| ) also note 
this upper bound and provide two lower bounds: H(pf) > H[f '(0)]/2 and H(pf) > 
J Jd H[f(x)] dX d (x). 

Remark 6.4. It is interesting that |(6.1)| is not true if ^ s is replaced by For example, 



let (X2i,X2i+i) be, independently for different i, (1, 1) or (0,0), each with probability 1/2. 
If v is the distribution of this process, then v is not stationary, but \i := {y + T(v)) /2 is 
stationary, where T is the shift. Next, H(/i) = log 2/2 < H[l/y/5\, even though At 1 _ 1 ^y2 ^ 
\i ^ l^x /s/2i as i s easn y verified. We also observe that this measure does not even have full 
support. 

Example 6.5. Let / be as in Example |1.2j , so that P-^ is the law of the horizontal edges 
of a uniform spanning tree in the square lattice Z 2 . It is known that the entropy of 
the entire uniform spanning tree measure (both horizontal and vertical edges included) is 
5.3)| ; see Burton and Pemantle (|1993j ). This can be compared to the bound on H(pf) 



that Proposition |6.2| provides, together with the calculations of Example |5.13| , namely. 



H(Pf) > H[e~ AG ^} = 0.6203+ (The results are much worse if one uses the bounds 
of Shirai and Takahashi (|2003|) reported in Remark |6.3| above.) Direct calculation using 
cylinder events corresponding to a 4-by-4 block gives an upper bound for the entropy 
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H(P f ) < 0.68864. Of course, the vertical edges of the uniform spanning tree measure have 
the same entropy as the horizontal edges. 



Example 6.6. Let g(x) be as in Example |L^, so that the edges of the uniform spanning tree 
in the plane that lie on the x-axis have the law P 9 . In view of Example |5.5| , Proposition |6.2| 
implies a lower bound of H(P 9 ) > H[y2 — 1] > 0.67835. Also, direct calculation using 
cylinder events of length 16 gives an upper bound H(P g ) < 0.69034. In Example |6.15| , we 
shall obtain more refined bounds on the entropy. It is not hard to see that the horizontal 
edges associated to the y-axis have law P 1-9 . 

Although not relevant for our determinantal probability measures (where we have 
seen that =^ s and =^ are equivalent), it is interesting to ask whether \x p ^ \x ^ {i\- p with 
p > yields any lower bound on the entropy H (fx) for general \i. We first ask the question 
whether \i p =<I \i with p > implies that H([i) > provided fx ^ Si. The answer to this 
question is affirmative since it is known (see Furstenberg (|1967| ) for d = 1 and Glasner, 
Thouvenot, and Weiss (|2000j ) for d > 1) that zero-entropy processes are disjoint from i.i.d. 



processes (meaning that there are no stationary couplings of them other than independent 
couplings). The next result provides an explicit lower bound on the entropy for processes 
trapped between two i.i.d. processes. The proof was obtained jointly with Chris Hoffman. 

Proposition 6.7. For any d > 1, if [i p ^ /i ^ [i\- v with p > 0, then 

H(fx) > max {a p , b p } > , 



where 



and 



a 



- =(1 - p)l0g (l 1 ^)- L 2^ l0g Vl-2p 



1 



b p := 2(1 - p) log (j^-) - (1 - 2p) log (y^J-) ~ (1 - 2p) log 2. 



pJ V 1 — 2p / 

Remark 6.8. Observe that b p approaches log 2 as p — > 1/2. 

Proof. Let X, Y and Z denote processes with respective distributions fi p , \x and \x\- p . We 
construct a joining (stationary coupling) of all three processes as follows. Consider any 
joining m\ of X and Y with Xi < Y\ a.s. and any joining of Y and Z with Y\ < Zi a.s. 
We now pick a realization for Y according to \x and then choose X and Z (conditionally) 
independently using m\ and respectively (this is called the fibered product of m\ and 
mi over Y). This gives us a joining (X', Y', Z') of X, Y and Z. We may now assume that 
{X,Y,Z) = {X',Y',Z f ). 
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We now use standard facts about entropy. First, H(X, Z) > H(X, Y, Z) — H(Y). We 
next note, using the conditional independence of X and Z given Y, that 

H(X, Y, Z) = H(Y) + H(X, Z \Y)= H(Y) + H(X \ Y) + H(Z \ Y) 
> H(X) + H(Z) — H(Y) . 

Hence 

H{X, Z) > H(X) + H(Z) - 2H(Y) = 2H[p] - 2H(Y) . 

Since X < Z, the one-dimensional marginal of (X, Z) necessarily has 3 atoms of weights 
p, 1 — 2p and p. It follows that 

H(X, Z) < 2plog (i) + (1 - 2p) log (jZt) ■ 

Combining this with the previous inequality shows that H(fj.) > a p . 

We modify the above proof to show that b p is also a lower bound. Rather than using 
H(X, Z) > H(X, Y, Z) - H(Y), we use H(X, Z) = H(X, Y, Z) - H(Y \X,Z). The earlier 
argument then gives 

H{X, Z) > 2H[p] - H(Y) - H(Y \X,Z). 

Using the same upper bound on H(X, Z) as before and the trivial upper bound for H(Y \ 
X,Z) of (1 — 2p)log2 (obtained by noting that Yi is determined by Xi and Zi when 
Xi = Zi) yields the lower bound of b p . 

The easiest way to check the strict positivity of a p is to observe that 

2H[p] > 2plog (-) + (1 - 2p) log (y^) • 

This, in turn, follows from the fact that the left-hand side is the entropy of the independent 
joining \x p x \i\- pi while the right-hand side is the entropy of the monotonic coordinatewise- 
independent joining of \i p and /ii_ p , whose existence follows from the inequality p < 1 — p. 

I 

There are few cases where we know the entropy H(P*) exactly. Besides the case when 
/ is constant, we can calculate the entropy when / is the reciprocal of a trigonometric 
polynomial of degree 1. We shall illustrate this in the simplest case, Example |1.7| . Thus, 
let < o < 1, d = 1, and f(x) := (1 — a) 2 /|e 27rix — a| 2 . Recall that that for any invariant 
measure \i in one dimension, we have 

H{y)= I H [vVlo = 1 I 77-1,77-2, •••]] d/x(7/_i,»7_2, •••)• (6-2) 
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Now by |(1.3)| , it is easy to check that 

P f [Vo = l\V-uV-2,...] = N _ (N _ 1)a , 

where N := min{/c > 1 ; r/_k = !}■ In addition, we may easily show that for n > 1, 

(1 — a)(n — (n — l)a)a n_1 



1 + a 
Therefore 



rr/pA _ y ( 1 ~ a )( n - ( n ~ l)a)a n V (l-a) 2 n (1 - a) 2 n 



1 + a V ?i — (n — l)a n — (n — l)a 

n=l 

n — (n — l)a — (1 — a) 2 n ^ n — (n — l)a — (1 — a) 2 n 



n — (n — l)a ' n — (n — l)a 

1^ E fln_1 ( - C 1 - a ) 2n MCI - a) 2 ™] 

n=l 

— [a(l — a)n + a] log[a(l — a)n + a] 
+ [(1 — a)n + a] log[(l — a)n + a] J . 

In principle, one can also write an infinite series of positive terms for the entropy of 
P-^ when / is equal to the reciprocal of any trigonometric polynomial, since, by Propo- 
sition |2.1U| , the process P-^ is then a regenerative process. Of course, the answer will be 



much more unwieldy than the above formula. However, it can be used to get arbitrarily 
good lower bounds on the entropy of any process P-f, in theory. To see this, we use the 
following lemma, which is more or less Lemma 5.10 in Rudolph (|1990| ). As our proof is 
somewhat shorter and gives a more precise bound, we include it. See also Burton and 
Pemantle (|1993| ), Lemma 6.2, for a similar proof. 



Lemma 6.9. For any stationary processes \i and v, we have \H(fi>) — H(v)\ < H[d(n, v)]. 

Proof. Consider a stationary process (X, Y) = ((Xi,Yi)) i€ %d, where X = (Xi) ieZ d has 
distribution fi, Y = (Yi) ieZ d has distribution u, and P(Xq ^ Yq) = d(n,v). Let Zi := 
X t + Y t mod 2. Then 

H((m) < H{X, Z) = H(Y, Z) < H(v) + H(Z) < H(u) + #(d(/z, v)\ . 



Since the same holds with \x and v switched, the result follows. 



I 
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Thus, given any measurable / : T — > [0, 1] and any e > 0, define 5 to be the smallest 
positive number so that H[8] = e. Then choose a trigonometric polynomial T > 1 such that 
J T |T _1 — max(/, 5/2)| d\\ < 8/2. We can calculate #(P 1 / T ) as closely as desired. Since 
dCP 1 ^, P / ) < 5 by Proposition gg, it follows by Lemma |Q| that |#(P J ) -H(P 1 / T )\ < e. 

Unfortunately, this method of calculation is hopeless in practice since when T has 
a high degree, it will take a very long time to see a renewal, which means that cylinder 
events of great length will be needed for the estimation. When a cylinder event of length 
n is used, one must calculate 2 n probabilities. Thus, this is completely impractical. 

However, we now exhibit an alternative method that works extremely well in practice, 
although we cannot prove that it works well a priori. We begin with two simple examples, 

f{x) := l[ ,i/2](aO (6.3) 

(which also occurred in Example |1.6| ) and 

g(x) := sin 2 nx = (1 — cos27rx)/2 . (6.4) 

Since f(k) = g(k) = for all even k ^ 0, both P-^ and P 9 have the property that looking 
at only the even coordinates, we see independent fair coin flips, i.e., Hi/2- Therefore the 
entropy of both processes is at least (1/2) log 2. In either case, we know of no direct method 
to prove that strict inequality holds, but it does. We first show this for P 9 . In Example p.5| 
we showed that GM(g) = 1/4. By symmetry, GM(1 - g) = 1/4. Since #[1/4] = 0.56+ > 
(1/2) log 2 = 0.35", we obtain by Proposition that if (P*) > (1/2) log 2. 

In order to show that H(pf) > (1/2) log 2 for / as in |(6.3)| , we need a method to 
obtain more refined entropy bounds. We illustrate such a method beginning with this 
simple function g. While we shall explain afterwards a method to obtain results for more 
general functions for the case d = 1, this first proof contains the essential idea of this 
method while at the same time relying on a more elementary calculation, and therefore, 
we feel, is worth including. 

Proposition 6.10. With g as in \(6.4A we have 



0.63" = -#[1/4] + -#[11/28] < H(P 9 ) < -#[7/20] + -#[5/12] = 0.67" . 
8 8 8 8 

Lemma 6.11. Let h : T — > [0, 1] be a trigonometric polynomial of degree at most 1, i.e., 
h(k) = for\k\ > 2. Fix n > and A C {1,2, ... , n}. For C C {1,2, ... , n}, let X(C) 
denote the sequence of lengths of consecutive intervals in {1, 2, . . . , n} \ C. Then 

P h [ V \A = 0, r,\({l,2,...,n}\A) = l]= J] D^h) . 

CCA ie\(C) 
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Proof. Whenever two sets F 2 C Z neither overlap nor come within distance 1 of each 
other, the configurations on Fx and F2 are P^-independent because h(k) = for > 2. 
Therefore P^t^l, 2, . . . , n} \ C) = 1] = U te x(C) D i-i( h )- Now the desired formula 
follows from the inclusion-exclusion principle. I 



Proof of Proposition \6.1(\ . Write d n := D n -i(g). Recall that 



H(P 9 ) = J H[P°[ Vo = 1 I V-i,V-2, ■ ■ ■}} dP^rj- 1: rj-2, • • •) • (6-5) 

By the negative association property of P 9 [ ■ \ rj-i = r\-i = 1], we have that on the 
event {r\-\ = r\- 2 = 1}, 

P 9 [r]o = 1 I 77-1, rj-2, ■■■}> lim P 9 [r/o = 1 | 77—1 = r}- 2 = ■ ■ • = r}- n = 1] 

n — >oo 

= lim P3 ^° = V-i = V-2 = ■ ■ ■ = V-n = 1] 



*oo P9 [ 7 y_ 1 = ry_ 2 = ... = rj_ n = 1] 
= lim d n+1 /d n = GM(g) = 1/4. 

n — »oo 

Likewise, on the same event, we have 

P 9 {r) = 1 I 77-1, ?7_2, • • •] < lim P 9 [r) = 1 \ 7]^ = rj_ 2 = 1, rj_ s = ■ ■ ■ = n_ n = 0] 

n — >oo 

= lim P 9 [rj = I = ?7_2 = 0, ?7_3 = • ■ ■ = rj- n = 1] 

n — >oo 

[by symmetry] 

= 1 - lim P 9 [r]o = 1 I r/-i = r]-2 = 0, 77-3 = • • • = V-n = 1] 

n — >oo 

P 5 [r7_i = 77_ 2 = 0, 770 = 77-3 = • • • = 77_ n = 1] 

= 1 — lim : — 

n^oo P9[r]- 1 = 7]-2 = 0, ?7-3 = • ■ • = ?7-n = 1J 

,. d\d n -2 — d2d n -2 — d\d n -\ + d n +± 
= 1 — lim 

n^oo d n -2 - d\d n -2 - 4-1 + d n 

[by Lemma |6.11| 
d 1 -d 2 -d 1 GM(g) + GM(g) 3 
1 — di — GM(g) + GM(5f) 2 

= 7/20 

since d\ = 1/2 and d 2 = 3/16. We may conclude that on the event {77-1 = 77_ 2 = 1}, we 
have 

H [P% = 1 I 77_ l5 77_ 2 , . . .]] G [#[1/4], #[7/20]] . 
By symmetry, the same holds on the event {77-1 = 77-2 = 0}. 
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Similarly, we have that on the event {77-1 = 1, 77—2 = 0}, 

P 9 [r) = 1 I t/_i, 77-2, • • •] > lim P 9 [rj = 1 | rj_ 2 = 0, rj-i = r/_ 3 = • • • = rj-n = 1] 

n — >oo 

= lim P9 ^- 2 = Q , Vo = V-i = V-3 = • ■ ■ = V-n = 1] 



n- 



00 P%_ 2 = 0, 77-1 = ?7_3 = • • • = r\- n = 1] 

— d n+ i 



d\d n -2 — d n 

_ d 2 -GM(g) 3 
di-GM(^)2 
= 11/28. 

Likewise, on the same event, we have 

P 9 [r]o = 1 I 77_i,r/_ 2 , . . •] < lim P 9 [?7o = 1 | 77-1 = 1, 77-2 = 77-3 = ■ ■ ■ = ?7_ n = 0] 

n — >oo 

= lim P 9 [t7o = I 77-1 = 0, 77-2 = 77-3 = • • • = 77-™ = 1] 

n — >oo 

= 1 - lim P s [?7o = 1 I 77-1 = 0, 77-2 = 77-3 = • • • = 77-™ = 

n — >oo 

P 3 [t7-i = 0, 770 = 77-2 = • • • = 77-™ = 1] 



= 1 — lim 

n — >cx 

= 1 — lim 
= 1 



P%_1=0, ?7_2 = ••• = 77_ n = 1] 

d\d n -i — d n+ i 
00 d n -i — d n 
dt-GMig) 2 



l-GM(g) 
= 5/12. 

We may conclude that on the event {t7_i = 1, 77_ 2 = 0}, we have 

#[P% = 1|t7-i,77_ 2 ,...]] e [#[11/28], i?[5/12]]. 

By symmetry, the same holds on the event {77-1 = 0, ry_2 = 1}. 

Putting all these bounds into |(6.5)| gives the claimed bounds on the entropy of P 3 . I 



For more general functions, we can get lower bounds on the entropy by using the 
asymptotics of Bump and Diaconis (|2002|) or of Tracy and Widom (|2002j) , which serve as 
replacements for our use of the special Lemma |6.11| . In fact, we use an extension of the 
formula of Tracy and Widom ( |2Q02[) due to Lyons ( |2002b| ). Denote by Uf the measure on 

2 N obtained by the limiting condition lim,^^ P-^[ - | 77 1 = 77—2 = ■ ■ ■ = 77.^ = 1], which 

exists by Proposition |2~^ . If GM(/) > 0, define 

Q f (z) :=exp| f + * log/ft) dA x (t) (6.6) 
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for \z\ < 1. The outer function 



(p f (t) := lim$ f (re 2nit ) (6.7) 

exists for Ai-a.e. t G T and satisfies \<Pf\ 2 = f Ai-a.e. This limit also holds in L 1 (T). See 
Rudin (P-987|) , Theorem 17.11, p. 340, and Theorem 17.16, p. 343. As is easily verified, 



log$/(«) = F(0) +2^T J F(k)z k (6.8) 

k=i 

when F := (1/2) log/. The mean value theorem for analytic functions and the L 1 (T) 
convergence above show that 

0/(O) = * / (O) = > /GM(/). 

Define $/ := <f>f := if GM(/) = 0. An analytic trigonometric polynomial is (a constant 
times) an outer function iff its extension to the unit disc has no zeroes in the open disc. [On 
the one hand, the extension of no outer function has any zeroes in the open disc since it is 
an exponential. On the other hand, by factoring a polynomial, one has to check merely that 
z i — > z — C is a constant times an outer function for |£| > 1. Indeed, &f(z) = (— \(\/()(z — C) 
for f(t) := |e 27 ™* — £| 2 , as can be seen by the Poisson integral formula for |£| > 1 (Rudin 
( |1987| ), Theorem 11.9, p. 235) and by a limiting procedure for |£| = 1.] The results of 
Lyons (|2002b|) show that Theorem [T9| reduces to the following formula: 

Theorem 6.12. For any measurable f : T — > [0,1], the measure Vf is equal to the de- 
terminantal probability measure corresponding to the positive contraction on £ 2 (N) whose 
(j, k) -matrix entry is 

^2fifti -l)<p}(k-l). 

1=0 

This can be substituted in the appropriate places in the proof of Proposition |6.10 . 
For example, on the event {r]-i = 17—2 = 0}, one has that 

P f [rjo = 1 I ry_i, rj- 2 , ■ ■ ■} > hm P / [rj = 1 | rj-i = n- 2 = 0, 77-3 = 77-4 = • • • = rj- n = 1] 

n — >oo 

= Vflm = 1, Vi = Vo = 0]/i//[?7i = 770 = 0] . 

One can use v\-f in a similar fashion for estimating such probabilities above by 77 that 
terminate in repeating 0s, rather than in repeating Is. For example, if 



/ := |l + ei +e 2 | 2 /9 
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then iff = (1 + ei + e2)/3 since the polynomial (1 + z + z 2 )/3 has no zeroes in the open 
unit disc. Similarly, one can show that 

(Pi—f = ei H e? . 

n / 6 3 6 



With these and Theorem 16.12 , one can show by following the method of proof of Proposi- 
tion |6.10| that 

0.53 < H(P f ) < 0.61. 

This method of estimation is relatively easy to program on a computer; decomposing by 
cylinder events of length 8 instead of the length 2 used in the proof of Proposition |6.10| 
and using Mathematica, we find that 

0.601992 < H(P f ) < 0.602433 . 

Similarly, using cylinder events of length 15, we obtain that 

0.65907716 < H(P 9 ) < 0.65907733 (6.9) 



for the function g in (6. 1)[ 



We now return to the problem of showing that H(pf) > (1/2) log 2 for / as in (6.3) 
Proposition 6.13. Iff := l[ ,i/2], then H(P*) > (l/2)log2. 

Proof. Let / := .991 [0 ,i/ 2 ] + .011(i/ 2) i). proposition tells us that d(P / ,P^) < .01, and 
hence from Lemma ^J, H(Pf) > H(Pf) - #[.01]. Now #[.01] < 0.0561. Write 5 := .01 
and L :— log(5 _1 — 1) = log 99. A simple integration shows that 

r (1/2) if* = o, 

(where % := v^— T). To find (py, we proceed as follows. Let F : = (1/2) log/. Then 

r (1/4) log 8(1 -8) iffc = 0, 
^ 4A;7r 



By using (6.8) and exponentiating, we obtain 
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Since the coefficients of this power series are the Fourier coefficients of ipj, we have a 



procedure to compute the Fourier coefficients if~. Since f(l 



x 



1 — /(x), we have 



^ x _j{z). Thus, the kth Fourier coefficient of ( P 1 _j equais the compiex conjugate 
of the kth Fourier coefficient of fj. Using the method of Proposition |6.10| with cylinder 
events of length 3, which requires knowledge of Fourier coefficients only for < 2, we ob- 
tain that if(P') > 0.4105. Therefore H(pf) > 0.4105 - 0.0561 = 0.3544 > (1/2) log 2. (If 
we use cylinder events of length 12, then we obtain the bound H(pf) > 0.4442. However, 
we believe that the true entropy is significantly larger still.) I 

Example 6.14. We can also use these bounds to prove that H(P->) does not depend only 
on the distribution of /. For example, consider the function 



f(t) :=sin 2 (7rf/2) 



on [0, 1], which has the same distribution as the function g of |(6.4)| . An elementary inte- 
gration shows that 

1/2 if/c = 0, 



/(*) 



2ki 



(2k- l)(2k + 1)tt 



if k ^ 



(where i := y— l). To find <pf, we proceed as follows. Write F := (1/2) log/. The real 
parts of the integrals giving F(k) can be found in standard tables, while the imaginary 
parts are derived in Lyons, Paule, and Riese (|2002| ), giving 



F(k) 



-log 2 

1 i 
-4k + k^ 



E 



if k = 0, 
if k > 0. 



As in the proof of Proposition 6.13, we compute 



= 1/2 + (-1/4 + i/n)z + (-1/16 + i/6n - l/n 2 )z 2 
- (1/32 - 19i/3607r + 5/6tt 2 + 2i/37v s )z 3 
+ (-5/256 + 89z/5040tt - 27/40tt 2 - i/n 3 + 1/3tt 4 )z 4 + 



whose coefficients are the Fourier coefficients of iff. Since 1 — sin 2 (7rt/2) = sin 2 (7r(l — 1)/2), 



it follows from the definition |( 6. 6 )| that <&f{z) = $>i-f(z). Since the power series coefficients 
of are the Fourier coefficients of <fi-f, we obtain that the kth Fourier coefficient of 

ifx-f equals the complex conjugate of the /cth Fourier coefficient of iff. Using the method 
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of Proposition |6.10 with cylinder events of length 8, which requires knowledge of Fourier 



coefficients only for \k\ < 7, we obtain that 

0.659648 < H(P f ) < 0.684021 . 



Comparing with |(6.9)| , we see that H(pf) > H(P 9 ). 



Note that even when neither / nor (pj can be found explicitly, they can always be 
found by numerical integration and exponentiation, and then one can follow the procedure 
we have used in this last example. Indeed, that will be done for part of our final example. 

Example 6.15. As in Example |6J| let 

sin ttx 

g(x) : 



+ sin 2 ttx 

Then the edges of the uniform spanning tree measure in the plane that lie on the x-axis 
have the law P 9 . Write gi(x) := sin 2 tvx. The calculations in Example |1.3| show that 
$ gi (z) = (1 - z) /2 and (z) = [V2 + 1 - (V2 - l)z}/2, whence 



$ 91 _ / 1-z 



V v^ + i_(V2-i; 

Expansion of <& g (z) in a Maclaurin series gives 

1 z (-l + 2V2)z 2 (5-2V2)z 3 (-27 + 28V2)z 4 

(1 + V2) 1 / 2 ~~ (1 + V2f/ 2 ~~ 2 (1 + V2) 5 / 2 ~ 2 (1 + V2) 7 / 2 8(l + v / 2) 9/2 



+ 



which tells us (p g . The transfer currents, which can be calculated by the method in Burton 
and Pemantle ( [1993D , tell us g; for example, g(k) for k = 0, 1, 2, 3, 4 is 



1 1 2 5 8 25 118 129 608 



2 2 tv 2 TV 2 3tv 2 3tv 

We use numerical integration to find Then if we use cylinder events of length 8, we 

find that 

0.69005 < H(P 9 ) < 0.69013. 
It is interesting how close this is to log 2 = 0.69315 - . 

We close our treatment of entropy with some elementary observations. If f n — > 1/2 
weak* (meaning that we have weak* convergence of the measures having these densities), 
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must H(P-> n ) — ► log 2? The answer is "no", as we now demonstrate. Given any / : T — > 
[0, 1] and any integer n, let M Xn f denote the function 

M xn f(x) :=f(nx). 

Let (rj ( k n) ; fc <E Z) have the distribution P M *»/. Since /(x) = E fc6 z /We 2 ™^ in l2 ( t ), 
the Fourier expansion of M xn / is M xn f(x) = J2kezf( k ) e2niknx for ^ 0. In particular, 
for any n ^ and any r, the processes (77^ r ; G Z) each have distribution and are 
independent of each other as r ranges from to n - 1. Therefore #(P M x™/) = if(p/). 
Note that unless / is constant, H(P M * nf ) = H(P f ) < H[f(Q)], even though M xn / tends 
weak* to the constant function /(0) ■ 1. One can show that a similar phenomenon holds 
in higher dimensions, that is, if / : T d — > [0, 1] and A : T d — ► T d is a group epimorphism, 
then if(P / ) = H(pf° A ). 



§7. Phase Multiplicity. 

In this section, we classify exactly the set of functions / for which P-^ satisfies a strong 
full K property. This property, which we now describe, is an essential strengthening of the 
usual Kolmogorov or K property. One of the reasons this property is interesting is that it 
is closely connected to the notion of multiplicity for Gibbs states in statistical mechanics; 
in particular, it corresponds to uniqueness. In the next section, we classify exactly the set 
of functions / for which P-^ satisfies a (1-sided) strong K property. 

In this section and the next, we always assume that / is not identically 
nor identically 1. 

To begin, we first recall the K property for stationary processes indexed by Z. There 
are many equivalent formulations, of which we choose an appropriate one. 

For S C Z d , write J-'(S) for the a-field on 2 Z generated by 77(e) for e G S. 

Definition 7.1. A translation-invariant probability measure \x on 2 Z is K (or a Kol- 
mogorov automorphism) if for any finite cylinder event E and for all e > there exists 
an iV such that 



\n(E I ^((-00, -N] n Z)) - n(E) I > e 



< e. 



It is well known that the above definition of K is equivalent to having a trivial (1-sided) 
tail cr-algebra in the sense that the a-algebra Dm>i -^{(~ °°> ~ m }) is trivial (see page 120 
of Georgii (|1988| ) for a version of this). The K property is known to be an isomorphism 
invariant and is also known to be equivalent to the property that all nontrivial factors have 
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strictly positive entropy. The latter implies that a process is K iff its time reversal is K, 
something which is not immediate from the definition. 

In order to give a complete discussion of the points that we wish to make, we need to 
introduce three further properties, which are respectively called the full K property, the 
strong full K property, and the (1-sided) strong K property, the last property given only 
for d = 1. 

While the notion of the K property generalizes to Z d (see Conze ( |1972/73"|) ), there is 
a slight strengthening of the definition that has a more aesthetic extension to Z d . To give 
this, define B d := [-n, n] d n Z d . 

Definition 7.2. A translation-invariant probability measure \x on 2 Z is full K if for any 
finite cylinder event E and for all e > there exists an N such that 



\li(E\H(B d N ) c ))-^E)\>e 



< e. 



Analogously to an earlier statement, the full K property is equivalent (see again page 
120 of Georgii ( |1988| )) to having a trivial full tail, which means that f| m>1 JF((S^) C ) is 
trivial. It is well known that K does not imply full K even for d = 1 (see, for example, 
the bilaterally deterministic Bernoulli shift processes constructed in Ornstein and Weiss 
(P-975D ). However, for Markov random fields, the two notions are equivalent (see den 
Hollander and Steif ( |1997| , gDDjj) ). Lyons ( |2002a|) proved that all (not necessarily Z d - 
invariant) determinantal probability measures satisfy the full K property. For Gibbs states 
arising in statistical mechanics, this property is equivalent to an "extremality" property 
of the Gibbs state (see page 118 of Georgii (|1988| ) for precisely what this means and this 
equivalence) . 

The following definitions strengthen the K property in an even more essential way. 
We begin with the full version, which seems to us more natural. 

Definition 7.3. A translation-invariant probability measure \i on 2 zd is strong full K 
if for any finite cylinder event E and for all e > there exists an N such that 

\ti{E\F{{B d N ) c ))-ii(E)\<e /x-a.e. 

For the full K property, "/i-most" conditionings far away have little effect on a "local 
event", while in the strong full K case, all conditionings far away have little effect on a 
"local event" . This is a substantial difference. An example that illustrates this difference 
is the Ising model in Z 2 . The plus state for the Ising model at high temperatures is strong 
full K, while at low temperatures, it is full K (because it is extremal) but not strong full 
K. 



7. Phase Multiplicity 



42 



Finally, if d = 1, we define strong K in the following way (which the reader can 
presumably anticipate). One extension of this definition to Z d (among various possible) 
will be given in Section |8]. 

Definition 7.4. A translation-invariant probability measure \i on 2 Z is strong K if for 
any finite cylinder event E and for all e > there exists an iV such that 

\n(E | Fd-oo, -N] D Z)) < e //-a.e. 

We note that this definition is closely related to, but weaker than, the -0-mixing 
property (see Bradley ( |1986| )). The difference is that the event E here is specified in 
advance, rather than having a uniformity for all events E, provided only that they depend 
on the random variables with positive index. If we extend the notion of strong K in the 
obvious way to the case when {0, 1} is replaced by a countable set, then the example in 
Bradley (|1986| ) of a process that is ^-mixing, but whose time reversal is not ^-mixing, 
yields an example of a strong K process whose time reversal is not. This cannot occur for 
our measures P-^ since they are all time reversible. 

The following development has an analogy with the "plus and minus states" in the 
Ising model from statistical mechanics; however, such knowledge is not needed by the 
reader. 

Consider a function / : T d — > [0, 1] and consider the corresponding probability mea- 
sure P-^ on 2 Z . We shall define a probability measure (P^) + on 2 Z which will be "P-^ 
conditioned on all Is at oo" ; more specifically (but still not precisely) , we want to define 
(P/)+ by 

(P / )+ := lim P / [ • | n = 1 on (B d ) c ] . 
To make sure this is well defined, we proceed in stages. Let 

((P / )+) n := hm p/[ • | n= 1 on B d n+k \B d n ] . 

fc — >oo 

This limit is taken in the weak* -topology. Proposition |2.6| implies that, when restricted to 
B d , this sequence is stochastically decreasing and hence necessarily converges. ((P^) + ) n 
is therefore well defined. One next defines 

(P f )+:= hm((P')+) n , 

n — >oo 

where again the limit is taken in the weak* -topology. Proposition |2.6| again implies that for 
fixed k, ((P-^) + ) n restricted to B d , is, for n > k, stochastically increasing in n and hence 
converges. This implies that its limit (P^) + is well defined and completes the definition of 
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(P^) + . The stochastic monotonicity results also imply that (P^) + =^ P-^ and that (P^) + 
is translation invariant. 

In exactly the same way (using instead of 1 boundary conditions), one defines (P-^) - , 
which satisfies P-^ ^ (P-^) - . Analogy with the ferromagnetic Ising model in statistical 
mechanics leads us to the following definition. 

Definition 7.5. The probability measure P-^ has phase multiplicity if (P^)~ ^ (P^) + 5 
and otherwise has phase uniqueness. 

Lemma 7.6. P-^ has phase uniqueness if and only if it is strong full K . 



The reason this is true is that Proposition |2.6| implies that the most extreme boundary 
conditions are when all Is or all Os are used and the measures corresponding to any 
other boundary conditions are "stochastically trapped" between these two special cases. A 
detailed proof follows straightforwardly from the stochastic monotonicity arguments above 
and is left to the reader. 

We shall now obtain necessary and sufficient conditions for the equality of P-^ and 
(pf) + . From these, a necessary and sufficient condition for the strong full K property will 
easily emerge. 

Let T denote the set of trigonometric polynomials on T d . Let L 2 {1/ f) denote the set 

h : T" — C : / d\ d < oo 



Here we use the convention that 0/0 := 0. Note also that h needs to vanish where / does. 
Our main result on phase multiplicity is the following. 

Theorem 7.7. Assume that f : T d — > [0, 1] is measurable. The following are equivalent. 

(i) (p/)+ = Pf; 

(ii) / is in the closure in L 2 (l/f) of T D L 2 (l/ f) ; 

I T I ^ 1 rl 

(iii) There exists a nonzero trigonometric polynomial T such that J-J_ £ L (T ); i.e., 

rnL 2 (i//)^o. 

Moreover, if (pf)+ ^ p/, then (pf)+ = S . 

Proof. We shall first show that (i) and (ii) are equivalent and then that (ii) and (iii) are 
equivalent. 

(i) implies (ii): Let u n be the element in [B]f achieving the minimum in |(2.5)| for 
B := (B d ) c . Then 

||l||j = ||1 - u n \\ 2 f + \\u n \\ 2 . 
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By (i) and |(2.5)| , we have ||1 — u n \\f — > ||1||/ as n — > oo, or in other words, — > as 
n — > oo. Furthermore, 1 — u n _L [(-B^) c ]/ in L 2 (/), which is the same as (1 — u n )f being 
a trigonometric polynomial T n with T n supported in B d . We have 



1/ 



T rJ(l//) 



\f~Tr 



f 



fdX d 



\u. 



l fd\ d =\\u n \\ 2 f (7.1) 



tends to as n — > oo, which proves (ii). 

(ii) implies (i): Given e > 0, let T be a trigonometric polynomial with ||/ — 211(1//) < 
e. Let n be such that T is supported in B d . Define v := T/f, which is in L 2 (f). Then 
v ± [{B d n y\ f in L 2 (f) and ||1 - v\\ f = \\f - T|| (1//) < e, so UP^^l^ > ||1||J - e 2 . 
Combining this with |(2.6)| , we see that (P / ) + [?y(0) = 1] > P / [?7(0) = 1] - e 2 . Since this 
holds for any e > 0, we obtain that (Pf) + [r)(0) = 1] > pf[r)(0) = 1]. However, since 
(P / )+ ^ P-f and both are translation invariant, the two measures are actually equal by 
Lemma [3.3| , which proves (i). 

(ii) implies (iii): This is immediate. 

(iii) implies (ii): Assume there exists a nonzero trigonometric polynomial T G 
L 2 (l/f). Let e > 0. Choose B CJ d such that 



B 



T 



+ 2\T\ + f) < e 



and B is an open set containing the zero set of T (which is a set of measure 0, as remarked in 
the proof of Theorem ^) . Let g := TIb + Then g/T G L°°(T d ) since T is bounded 

away from on B c and < / < 1. We can now choose trigonometric polynomials p n on T d 
such that p n — > g/T a.e. on T d with ||p n ||oo < lb/Til oo for all n (just use p n '■= (g/T) * K d , 
where K d is, as before, the Fejer kernel for T ). Since \T\ 2 /f G L 1 (T d ), it follows from the 
Lebesgue dominated convergence theorem that 



lim 



\T\ 



9_ _ 

rp Pn 



0. 



n^oo Jj d f 

Hence there exists a trigonometric polynomial h such that 



f 



9 



-h 



< e. 



Minkowski's inequality (applied to L 2 (\T\ 2 / f)) yields 



(7.2) 



\Tf 

T d f 



T 



2\ 2 




T T 



2\ 2 



+ 



\T[ 

T 



T 



i 

2\ 2 
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The second summand is at most e 1 / 2 by (7.2) . The first summand is 



IT 



2 - *\ 2 



/ 



T 



< [ / /+2|ti+!3_v < e ^ 



by choice of £?. Hence J Td |/ — T/i| 2 4 <ix < 4e. Since T/i is a trigonometric polynomial, (ii) 
is proved. 

Finally, assume that (P / )+ ^ PA Since (iii) fails, [(B*) c ]f = for all n, which means 
by [23)1 that (P / ) + [?7(0) = 1] = 0, or, in other words, (P / )+ = 5 . I 

Example 7.8. If / : T — > [0, 1] is continuous and has a finite number of 0s with / ap- 
proaching each of these 0s at most polynomially quickly, then (P^) + = P-^ since it is 
easy to construct a trigonometric polynomial T with the same 0s as / and approaching 
at least as quickly as / (for example, T could be of the form Yli=i si* 1 ™ 27r(a; — a;)). In 
particular, if / is real analytic and not 0, then (P^) + = P^. 

Example 7.9. If / vanishes on a set of positive measure, then (P^) + = $o- 

Example 7.10. If / : T — > [0,1] is a continuous function with a single zero at xq G T 
and f(x) = e _1 /l x_a:o l in some neighborhood of xq, then (P^) + = So. Indeed, there is 
no nonzero trigonometric polynomial T with \T\ 2 /f G L 1 (T) since the rate at which a 
trigonometric polynomial approaches is at most polynomially quickly. 

Example 7.11. We give an example to show that the property (P^) + = P-^ does not de- 
pend only on the distribution of /, even among real-analytic functions /. In 2 dimensions, 
the function 

f{x,y) := sin 2 (27ry — cos27rx) 

generates a system for which (P^) + 7^ PT This is because / vanishes (even to second 
order) on a curve 

1 . 
y = — cos 2irx + Z (7-3) 

that is not in the zero set of any trigonometric polynomial in two variables except the 
zero polynomial. However, / has the same distribution as the trigonometric polynomial 
g(x,y) := sin 2 2ny and (P s ) + = P s . To see that the curve |(7.3)| does not lie in the 
zero set of any nonzero trigonometric polynomial, suppose that T(x, y) is a trigonometric 
polynomial that vanishes there. Write w := e 2lTlx and z := e 27Tiy . By multiplying by 
a suitable complex exponential, we may assume that P(w, z) = T(x, y) is an analytic 
polynomial. Our assumption is that 



h(w) := P(w, e 
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satisfies h(w) = for all \w\ = 1. Since h is an analytic function for w ^ 0, it follows 
that h(w) = for all w ^ 0. Now for each z ^ 0, there are an infinity of w such that 
e i(w+w )/2 _ z ^ Therefore for each z ^ 0, there are an infinity of w such that P(w, z) = 0. 
Since a polynomial has only a finite number of zeroes if it is not identically zero, this means 
that for each z ^ 0, P(w, z) = for all wGC. Hence P = 0, as desired. 

Example 7.12. For d = 2, if / is real analytic on a neighborhood of its zero set, then 
(Pf) + = P-^ iff its zero set is contained in a (nontrivial) algebraic variety, where we view 
T 2 as {(z±, Z2) G C 2 ; \z\\ = \z%\ = 1}. This is because the slowest / can vanish at a point 
is of order x 2 + y 2 (since the constant and linear terms of / must vanish) and l/(x 2 + y 2 ) is 
not integrable. Therefore, all the zeroes of / must be cancelled by those of T. Conversely, 
if the zero set of / is contained in the zero set of a trigonometric polynomial T, then 
by Lowasiewicz's inequality (see, e.g., Bierstone and Milman ( 1988Q , Theorem 6.4), for a 
sufficiently large n, we have \T\ n / f is bounded, whence integrable. 

Example 7.13. Here, we give a 1-dimensional example showing that the property (P^) + = 
P-f does not depend only on the distribution of /, even among continuous /. Let f(x) : = 
y/x\sm(ir/x)\ on [0,1]. Then (pf)+^pf since there are an infinite number of first-order 
zeroes. However, let g be the increasing rearrangement of / on [0, 1] and define 



h(x) :- 



g(2x) if x < 1/2, 
0(2 -2a;) if x > 1/2. 



Then h is continuous and has the same distribution as /, yet (P^) + = P h by Example 
together with an easy computation. Of course, there is no such example for real-analytic 
/ in one dimension by Example 



We finally state and prove our necessary and sufficient condition for the strong full K 
property. 

Corollary 7.14. Consider f : T d — > [0, 1] and the corresponding probability measure P-f . 
Then P-^ is strong full K if and only if there is a nonzero trigonometric polynomial T such 
that -J^y G L x (T d ). 

Proof. According to Theorem [7.7| , (P^) + = P-^ iff there exists a nonzero trigonometric 

\rp |2 , 

polynomial T x such that ^f- e L L (T d ). The same argument applied to 1 — / tells us 
that (P-^) - = P-^ iff there exists a nonzero trigonometric polynomial T2 such that G 



L 1 (T d ). Lemma ^]6| now completes the proof. (Take T := T\T%.) 



Remark 7.15. Observe that by trivial scaling, we could have an / whose Fourier coeffi- 
cients go to zero slowly but which is bounded away from and 1. Hence slow decay of 
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Fourier coefficients does not impiy phase multiplicity. On the other hand, if the coefficients 
decay exponentially, then Ylnei fi 12 ) 2 ™ * s complex-analytic in an annulus around the unit 
circle, which implies that its restriction to the unit circle is real-analytic and hence there 
is phase uniqueness. 

Remark 7.16. We mention that if (P^) + = P^, then this measure is F0mer independent 
in the sense of Adams ( |1992|) and if in addition (P^)~ = P-^, then this measure is even 
strong Folner independent in the sense that any conditioning outside a box yields a measure 
which is d close to the unconditioned process. The arguments for proving these facts are 
analogous to those of Ornstein and Weiss ( |1973| ) (see Adams ( |1992| ) for a published version), 
where it is proved that the plus state for the Ising model is a Bernoulli shift. (The concept 
of F0lner independence has been used also in den Hollander and Steif ( p.997| ) and Hoffman 

We close this section with an interpretation of our phase multiplicity results in terms 
of interpolation questions for wide-sense stationary processes. The following comments 



can be proved from Theorem |2.8| together with the well-known correspondence between 
interpolation and Szego infima. First, (P^) + = So is equivalent to being able to interpolate 
perfectly from information far away for a wide-sense stationary process (Y n ) with spectral 
density /, which means that for every n, Yq is in the closed linear subspace spanned by 
{^&}|fc|>n- For d = 1, it is stated on p. 102 of Rozanov ( |1967| ) that this latter condition 
is equivalent to the negation of condition (iii) in Theorem |7.7| . A similar statement holds 
for (P^)~ with / replaced by 1 — /. The proof of Theorem |7.7| implies (via this whole 
correspondence) that, for spectral measures that are absolutely continuous with a bounded 
nonnegative density, if perfect linear interpolation fails, then our ability to interpolate as n 
gets large goes to 0; i.e., the length of the projection of Yq onto the closed linear subspace 
spanned by {Yk}\k\> n g° es to as n — > oo. 



§8. 1-Sided Phase Multiplicity. 

We first study the notion of strong K for d = 1. This is natural since the struc- 
ture of "one-sided" behavior can be very different from "two-sided" behavior in various 
situations; for example, the existence of bilaterally deterministic Bernoulli shift processes 
demonstrates this. In contrast with Example |7.13| , we shall see that whether is strong 
K depends only on the distribution of /. 

Consider a function / : T — > [0, 1] and the corresponding probability measure P-^ on 
2 Z . We define a probability measure (P-^)" 1 "' 1 on 2 Z thought of as "P-^ conditioned on all 
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Is at — oo" (the superscript "1" refers to the fact that we are doing this on 1 side); namely, 
(p/)+> x : = lim lim [ • | 77 = 1 on [-n - k, -n] n Zl . 

n— »oo fc^oo 

The existence of the limit and its translation invariance follow from stochastic monotonic- 
ity, as did that of (P-^) + . As before, we also have (P-^)" 1 "' 1 P^. In the analogous 
way (using instead of 1 boundary conditions), one defines (P-^) - ' 1 , which then satisfies 
P^ ^ (P / )-' 1 . 

Definition 8.1. The probability measure P-^ has a 1-sided phase multiplicity if (P-^) - ' 1 7^ 
(P-^)" 1 "' 1 , and otherwise has 1-sided phase uniqueness. 



Note the following analogue (whose proof is left to the reader) of Lemma 7J6 for the 
1-sided case. 

Lemma 8.2. P-^ has 1-sided phase uniqueness if and only if it is strong K . 

Our main result for 1-sided phase multiplicity is the following. 

Theorem 8.3. If f : T — > [0, 1], then (P^)+> 1 = P^ iffGM(f) > 0. Moreover, z/(P^)+> 1 / 
P / , then (P / )+' 1 = 5 . 



Proof. By Theorem 5.12, we have 



n-l 



lim P f [r](0) = 1 I rj- n = r}- n -x = ■■■ = r\- n - k = 1] = \<^f{l)[ 



1=0 



(In fact, via Theorem |2.8| , this special case of Theorem |6.12| is due to Kolmogorov and 



Wiener; see Grenander and Szego ( |1984| ), Section 10.9.) Taking n — > 00, we find that 

(p / ) + ' 1 Wo) = i] = ||^||| = ||^/|| 2 2 . 

If GM(/) > 0, then we obtain 

(P'^faO) = 1] = H/llx = /(0) = P f [ V (0) = 1] . 

Since (P^) + ' 1 =4 P^ and both probability measures are Z-invariant, it follows by Lemma 
that (P- f )+' 1 = Pf. On the other hand, if GM(/) = 0, then 

(P^)+' 1 [r / (0) = l] =0, 

so (p/)+>! = S . 
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Our necessary and sufficient condition for the strong K property now follows imme- 
diately 

Corollary 8.4. For f : T — > [0,1], the corresponding probability measure P? is strong K 
if and only if GM(/)GM(1 - /) > 0. 

We now construct a process P-^ that is strong K but not strong full K. 

Theorem 8.5. There exists an f : T — > [0, 1] such that P? is strong K but not strong full 
K. 

Proof. Choose / such that / is continuous, bounded away from 1, vanishes at a single point 
Xq, and such that f(x) = e -1 ^ ^ x ~ x °^ 1 in some neighborhood of Xq. Since a trigonometric 
polynomial vanishes at its zeroes at most polynomially quickly, there cannot exist a nonzero 
trigonometric polynomial T such that \T\ 2 / f 6 L 1 (T). Hence by Corollary [7.14| , P-Ms not 
strong full K. On the other hand, it is clear that GM(/)GM(1 — /) > 0, so by Corollary |8.4| , 
we know that P* is strong K. I 



Remark 8.6. By combining Theorems |5.3| and |8.3| , we see that P / ^ P 9 implies (P- f )+> 1 ^ 



(P 9 ) + ' 1 . However, it is not necessarily the case that (P^) + ^ (P 9 ) + . For example, let 
(7 = 1/2 and / be a function as described in the last result, but which has a geometric 
mean larger than 1/2 (recall Theorem |5.3|) . This example also shows that if P s ^ P-^, 



there do not necessarily exist g' < f such that P 9 = P 9 and P-^ = P-^. To see this, let 
/ and g be as above. Then P 9 = P 9 easily implies that g' = 1/2, which in turn (using 
g' < f) implies that (P f ') + = P f ' . Since (P ; ) + ^ P / , this contradicts the fact that 
P/=P/'. 



It may be of interest to see another proof (which we only sketch) of Theorem |8.3| that 
does not depend on the asymptotics of Theorem |6.12| , but rather follows the lines of the 
proof of Theorem |7.7| . The appropriate replacement of the trigonometric polynomials for 
this question is the set 

A := {h E L 2 (T) ; there exists £ such that h(k) = for k < £} . 

Note that when I is taken to be 0, we get the Hardy space of analytic functions, denoted 
H 2 (T). 

Theorem 8.7. Assume that f : T — ► [0, 1] and let A be as defined as above. The following 
are equivalent. 

(i) (PZ)+' 1 =P/; 

(ii) / is in the closure in L 2 (l/f) of An L 2 (l/f); 
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(iii) There exists a nonzero element T G A such that ^ G L 1 (T); i.e., AnL 2 (l/f) ^ 0; 

(iv) GM(/) > 0. 

Moreover, if (P / )+' 1 / P^, t/ien (P / )+' 1 = 5 . 



Proof, (i) iff (ii) and (ii) implies (iii): These are proved exactly as in Theorem \7.7[ 

• • • * I 1 

(iii) implies (ii): Fix a nonzero T E A with J— ^ — G L (T). Given any e, there exists 
5 > such that if B is any subset of T with measure less than 6, then 



B 



IT 



2 



/ 



+ 2|T| +/ < e. 



Since T vanishes only on a set of measure (see Rudin ( |1987|) , Theorem 17.18, p. 345) 



there is a 7 > such that {x G T ; |T(x)| < 7} has measure less than 5. If we take B to 



be this latter set, then we can proceed exactly as in Theorem |7.7| since now T is bounded 
away from on B c . 

(iii) implies (iv): Any function in A is a complex exponential times a function in the 
Hardy space H 2 (T). Hence there is a nonzero function T G H 2 (T) with \T\ 2 / f G L 1 (T). 
Now GM(|T|) > (see Rudin ( p87l) , Theorem 17.17, p. 344). Letting g := |T| 2 //, we 
have that g G L 1 (T) and hence Gl\%) < J g < 00. Therefore GM(/) = GM(|T| 2 /#) = 
GM(|T|) 2 /GM(#) > 0, as desired. 

(iv) implies (iii): If GM(/) > 0, then take T := 99/, defined in Section |5|. For this 
T, we have |T| 2 // = 1, which is trivially integrable. 

The final statement is proved as in Theorem [7.7| . I 



We now extend Theorem |8.3| and Corollary |B.4| to higher dimensions. Fix a choice of 
ordering of Z d given by a set n and a sequence (fc n ) G Z d such that the distance from 
to — (n + k n ) goes to 00. 

Definition 8.8. A translation-invariant probability measure \i on 2 Z has phase unique- 
ness relative to the ordering induced by n and the sequence (k n ) as above if for 
any finite cylinder event E and for all e > there exists an iV such that for all n > N 

\{j,(E I JF(-(n + k n )) - /i(E) I < e /z-a.e. 



It is easy to check using Proposition |2.6| that this definition does not depend on the 
choice of the sequence (k n ). However, it turns out that the choice of ordering, n, of Z d 
can make a difference. In particular, if the ordering is archimedean (meaning that for any 
two positive elements a and b, there exists an integer n such that na > b), such as* when 



* In fact, all archimedean orders arise in this way; see Teh (1961), Zaiceva (1953), or Trevisan (1953). 
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the ordering is induced by IT := {k E Z d ; k - x > 0}, where x E M. d is a fixed vector having 
two coordinates with an irrational ratio, then we have a complete characterization of 1- 
sided phase multiplicity in terms of the geometric mean. For the standard lexicographic 
ordering, however, 1-sided phase multiplicity cannot be characterized by the geometric 
mean alone, as shown by the example / := l[o,i]x [0,1/2] m two dimensions. Here, both 
/ and 1 — / have geometric mean, but the columns of a configuration are independent 
under P-^, so that conditioning on the remote past has no effect on the present. On the 
other hand, positivity of the geometric means will still suffice for phase uniqueness relative 
to the ordering induced by IT, as we now prove. Analogously to the one-dimensional case, 
we let, for our given ordering IT, 

(p/)+,i : = lim P / [ • I 77 = 1 on - (n + fc n )l . 

n — »oo 

By Proposition |2.6| , this limit exists and is independent of (k n ). As before, (P-^) - ' 1 is 
defined analogously using boundary conditions of Os, one-sided phase multiplicity is defined 
by (P-^) - ' 1 7^ (P^) + ' 1 , and the property in Definition |8.8| is equivalent to one-sided phase 
uniqueness. 

Theorem 8.9. Let f : T d -> [0,1]. If GM(/) > 0, then (P^)+- 1 = P^. If GM(/) = 

and the ordering is archimedean, then (P-^)" 1 "' 1 = 5o. In particular, if the ordering 
is archimedean, then has phase uniqueness (relative to this ordering) if and only if 
GM(/)GM(l-/)>0. 

We begin with the background results on which we rely for the proof. For an ordering 
induced by IT, define the corresponding Helson-Lowdenslager space 

HL 2 := HL 2 (T d , II) := {/ E L 2 (T d ) ; supp / C II U {0}} . 

For < / E L 1 (A), let [IlU{0}] be the linear span of {e k ; k E nu{0}} and [nu{0}]/ be 
its closure in L 2 (f). The replacement for outer functions is the class of spectral factors, 
which are the functions 6 HL 2 with the additional properties 

£(0) > (8.1) 

and 

V^[nu{o}] M2 . (8.2) 

Helson and Lowdenslager ( |1958| ) show that for any II and for any < / E L 1 (T d ), the 
condition GM(/) > is equivalent to the existence of a spectral factor (pj such that 
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\ff\ 2 = f a.e. [More precisely, they prove GM(/) > iff 3f G HL 2 satisfying \(p\ 2 = f a.e. 
and (8.1) . Their proof shows that in this case, (p can be chosen so that also |(8.2) holds.] 



Furthermore, Lyons ( 2002b ) shows that iff is then unique. 



Denote by Vf the measure on 2 nu ^ ^ given by P-^[ ■ | r\ [(—11) = 1]. This is, as usual, 
defined by conditioning on more and more Is in —IT and using Proposition 2~T>. The results 
of Lyons ( f2002b ) imply the following. 

Theorem 8.10. Fix an ordering induced by a set IT. Let f : T d — > [0, 1] be measurable. If 
GM(/) > 0, define ipf to be its spectral factor (with respect to li). Otherwise, define tpf := 
0. If GM(/) > or the ordering of Z d given by H is archimedean, then the measure Vf 
is equal to the determinantal probability measure corresponding to the positive contraction 
on £ 2 (U U {0}) whose (j, k)-matrix entry is 

zenu{o}, i4j,k 

Proof of Theorem \8.f\ . Suppose first that GM(/) > or the ordering given by IT is 
archimedean. By Theorem |8.10| , we have 

P'fo(O) = 1 | n\-(U+k n ) = 1] = P f [rj(k n ) = 1 | n\(-U) = 1] = £ \$}{k n -l)? • 

zenu{o}, i4k n 

Taking n — > oo, we find that 

(vf)+> 1 [ v (o) = i] = \mg = y f \\l 

If GM(/) > 0, then we obtain 

(p^+'^o) = i] = Wfh = /(o) = p/[^(o) = i] . 

Since (P-^)" 1 "' 1 =<! P-^ and both probability measures are Z d -invariant, it follows that 
(P^)" 1 "' 1 = pf . However, if GM(/) = and the ordering is archimedean, then 

(P^)+' 1 [r ? (0) = l] =0, 

so (P-^)" 1 "' 1 = 5o. The last statement of the theorem follows as before. I 

Analogously to Section ^ our one-sided phase multiplicity results for d = 1 can be 
translated to known results for prediction (rather than interpolation) questions for wide- 
sense stationary processes. We leave these observations to the reader. 
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§9. Open Questions. 

Question 9.1. Calculate H(P f ). 

We conjecture that entropy is concave: 

Conjecture 9.2. For any f and g, we have H(P^+ 9 ^ 2 ) > (H(P f ) +H(P 9 ))/2. 

Question 9.3. Letting A vary over all sets of measure 1/2, how do we get the largest and 
the smallest entropies for P 1 - 4 ? More generally, given /, which g with the same distribution 
as / maximize or minimize H(P 9 )7 

Question 9.4. If d = 1 and GM(/)GM(1-/) > 0, do we get arbitrarily close lower bounds 
(in principle) on H(pf) by the method in Section [|? How can one get close lower bounds 
on the entropy for higher-dimensional processes? 

Question 9.5. Suppose / : T — > [0, 1] is a trigonometric polynomial / of degree m. Then 
P-f is m-dependent, as are all (m + l)-block factors of independent processes (as defined by 
Hedlund (|1969| )). Is it the case that P-^ is an (m-fl)-block factor of an i.i.d. process? Erik 
Broman (personal communication) has shown this when m = 1. If one can find such block 
factors sufficiently explicitly for trigonometric polynomials, then one could find explicit 
factors of i.i.d. processes that give any process P-' . This would enable one to use more 
standard probabilistic techniques to study PA More generally, in higher dimensions, if 
/ : T d — > [0, 1] is a trigonometric polynomial, is P-^ a block factor? 

Question 9.6. Given an ordering IT on Z d and an increasing sequence of finite sets H n C IT 
whose union is all of IT, when is there phase multiplicity in the sense that 

lim P f [r](0) = 1 | 77f(n\n n ) =1] ^ lim P / [r ? (0) = 1 | r;t(n\n n ) = 0]? 

n — >oo n — >oo 

This clearly does not depend on the choice of II n . In one dimension, this is the same as 
1-sided phase multiplicity, where we know the answer. 

Question 9.7. Suppose that d = 1. Note that translation and flip of / yield the same 
measure PA Does P-^ determine / up to translation and flip? 

We now ask about some properties that are important for models in statistical physics. 



Let Z be as in Definition |5.15| . A stationary process v is called quasi-local if v[r){0) = 1 | 
Z] has a continuous version, where the product topology is used on 2 Z \{°I. It is called 
almost surely quasi-local if v[r)(0) = 1 | Z] has an almost surely continuous version. 
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Question 9.8. For which / is P-^ quasi-local or almost surely quasi-local? When there 
is phase multiplicity, then P-^ is as far as possible from this since then each version of 
Pf[r)(Q) = 1 | Z] is nowhere continuous. In one dimension, there is a natural definition of 
one-sided quasi-locality, which is the same as what is sometimes referred to as the uniform 
martingale property (see Kalikow ( 199C| )). It is not hard to see that this property, or even 



its almost sure version, would imply an affirmative answer to Question 

We say that a stationary process v on 2 Z is insertion tolerant if v[rj(0) = 1 | Z\ > 
i/-a.s. Similarly, v is deletion tolerant if u[r)(0) = | Z] > z^-a.s. 

Conjecture 9.9. Every P-f is insertion and deletion tolerant, other than the trivial cases 
f = and f = 1. 



Theorem |5.16| implies a positive solution to Conjecture ^T9| when l/[f(l — /)] is inte- 



grable. Indeed, we see by Theorem |5.16| that this is the exact criterion for uniform insertion 



tolerance and uniform deletion tolerance. 
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